[V3 Downloader] Revision tracking (#2571)

* feat(downloader): Install cog from specific commit in repo (initial commit)

- Repo and Installable have commit property now
- New class inheriting from Installable -
InstalledCog (old one from converters.py removed)
- New Repo.checkout() method, which is also async ctx manager
ref #2527

* fix(downloader): Keep information about repo's branch in config

- This is needed to make sure that repo can go back from detached state in some rare unexpected
cases
- current branch is determined by `git symbolic-ref` now as this command errors for detached
HEAD

* feat(downloader): Update repo without cogs, update single cog

The most important part of issue #2527 has been added here
- `[p]repo update` command added
- new conf format - nested dictionary repo_name->cog_name->cog_json
  installed libraries are now kept in conf too
  - `InstalledCog` renamed to `InstalledModule` - installed libraries use this class
  - `Downloader.installed_libraries()` and `Downloader.installed_modules()` added
  - `Downloader._add_to_installed()` and `Downloader._remove_from_installed()`
    now accept list of modules, of both cogs and libraries
- `[p]cog install` tells about fails of copying cog and installing shared libraries
- `[p]cog update` will truly update only chosen cogs (if provided) or cogs that need update
  - pinned cogs aren't checked
  - before update, repos are updated
  - to determine if update is needed `Repo.get_modified_modules()` is used
- `[p]cog pin` and `[p]cog unpin` commands for pinning/unpinning cogs added
- `Repo.checkout()` allows to choose ctx manager exit's checkout revision
- `Repo.install_cog()` returns `InstalledModule` now and raises CopyingError (maybe breaking?)
- `Repo.install_libraries()` returns 2-tuple of installed and failed libraries (maybe breaking?)
- `RepoManager.get_all_cogs()` added, which returns cogs from all repos
- `RepoManager.repos` property added, which contains tuple of `Repo`

* test(downloader): Repo.current_branch() throws an exception, when branch can't be determined

* style(downloader): rename _add_to_installed to _save_to_installed

This method is used for both adding and updating existing modules in Config

* refactor(downloader): add ctx.typing() for few commands

`[p]cog install` is nested hell, can't wait for moving install logic to separate method

* fix(downloader): refactor and fix `set` usage

* perf(downloader): update commits for ALL checked modules to omit diffs next time

This will also disable running git diff for cogs that have the same commit as the latest one

* style(downloader): few style improvements

- use of mutable object in method definition
- make Repo._get_full_sha1() public method
- too long
line
- don't use len to check if sequence is empty

* feat(downloader): add `[p]cog updateallfromrepos` and `[p]cog updatetoversion` commands

- moved cog update logic into `Downloader._cog_update_logic()` (lack of better name)
  - splitted
whole cog update process into smaller methods
  - might still need some improvements
- added new
methods to `Repo` class:
  - `is_on_branch()` to check if repo is currently checked out to branch

- `is_ancestor()` to check if one commit is ancestor of the other
- fix for
`Downloader._available_updates()` behaviour
broken by commit
5755ab08ba67556b3863e907c6f44d80f4f13d88

* feat(downloader): try to find last commit where module is still present

Enhancements:
- `Installable` now has `repo` attribute containing repo object or `None` if repo is
missing
- `Downloader._install_cogs()` and `Downloader._reinstall_libraries()` are able to install
modules from different commits of repo
- `Repo.checkout()` as ctx manager will now exit to commit
which was active before checking out
- unification of `rev` and `hash` terms:
All function
parameters are explicitly called `hash`, if it can only be commit's full sha1 hash or `rev` if it
can be anything that names a commit object, see
[link](https://git-scm.com/docs/git-rev-parse#_specifying_revisions)
- new
`Repo.get_last_module_occurence()` method, which gets module's Installable from last commit in which
it still occurs

* docs(downloader): Add basic description for `InstalledModule`

* fix(downloader): cog ignored during updates if its commit was missing

After config format update, commit string is empty until update and when such cog was checked and it
wasn't available in repo anymore, it was ignored

* refactor(downloader): Installing cogs from specific rev will pin them

* perf(downloader): Don't checkout when current commit equals target hash

- changes to `Repo.checkout()`:
  - `exit_to_rev` is now keyword only argument
  - added
`force_checkout` to force checkout even if `Repo.commit` value is the same as target hash

* refactor(downloader): Repo._run() stderr is redirected to debug log now

- added two keyword arguments:
  - `valid_exit_codes` which specifies valid exit codes, used to
determine if stderr should be sent as debug or error level in logging
  - `debug_only` which
specifies if stderr can be sent only as debug level in logging

* style(downloader): stop using `set` as arg name in `_load_repos()`

* feat(downloader): pass multiple cogs to `[p]cog (un)pin`

* refactor(downloader): accept module name instead of instance, fix spelling

* style(downloader): few small style changes

* fix(downloader): add type annotations + fixes based on them

- fix wrong type annotations and add a lot of new ones
- add checks for `Installable.repo` being `None`
- fix wrong return type in `Downloader._install_requirements`
- show repo names correctly when updating all repos
- fix error when some requirement fails to install

BREAKING CHANGE:
- type of `Repo.available_modules` is now consistent (always `tuple`)

* tests: use same event loop policy as in Red's code

* enhance(downloader): fully handle ambiguous revisions

* build(deps): add pytest-mock dependency to tests extra

* fix(downloader): minor fixes

* feat(downloader): add tool for editing Downloader's test repo

This script aims to help update the human-readable version of repo
used for git integration tests in ``redbot/tests/downloader_testrepo.export``
by exporting/importing it in/from provided directory.

Note
----
Editing `downloader_git_test_repo.export` file manually is strongly discouraged,
especially editing any part of commit directives as that causes a change in the commit's hash.
Another problem devs could encounter when trying to manually edit that file
are editors that will use CRLF instead of LF for new line character(s) and therefore break it.

I also used `.gitattributes` to prevent autocrlf from breaking testrepo.

Also, if Git ever changes currently used SHA-1 to SHA-256 we will have to
update old hashes with new ones. But it's a small drawback,
when we can have human-readable version of repo.

Known limitations
-----------------
``git fast-export`` exports commits without GPG signs so this script disables it in repo's config.
This also means devs shouldn't use ``--gpg-sign`` flag in ``git commit`` within the test repo.

* tests(downloader): add git tests and test repo for them

Also added Markdown file that is even more clear than export file
on what the test repo contains.
This is manually created but can be automated on later date.

* test(downloader): add more tests related to RepoManager

These tests use expected output that is already guaranteed by git tests.

* chore(CODEOWNERS): add jack1142 to Downloader's folders

I know this doesn't actually give any benefit to people that don't have
write permission to the repo but I saw other big fella devs doing this,
so I think this might be advisable.

* enhance(downloader): allow easy schema updates in future

* enhance(downloader): more typing fixes, add comments for clarity

* feat(downloader): add python and bot version check to update process

follow-up on #2605, this commit fully fixes #1866

* chore(changelog): add towncrier entries

* fix(downloader): use `*args` instead of `commands.Greedy`

* fix(downloader): hot-reload issue - `InstallableType` now inherits from `IntEnum`

There's desync of `InstallableType` class types due to hot-reload
and `IntEnum` allows for equality check between different types

* enhance(downloader): ensure there's no cog with same name installed

should fix #2927

* fix(downloader): last few changes before marking as ready for review
This commit is contained in:
jack1142
2019-11-08 02:36:16 +01:00
committed by Michael H
parent d85fb260e7
commit e2c8b11008
40 changed files with 2922 additions and 452 deletions

View File

@@ -21,7 +21,7 @@ REPO_INSTALL_MSG = _(
_ = T_
async def do_install_agreement(ctx: commands.Context):
async def do_install_agreement(ctx: commands.Context) -> bool:
downloader = ctx.cog
if downloader is None or downloader.already_agreed:
return True

View File

@@ -1,14 +1,14 @@
import discord
from redbot.core import commands
from redbot.core.i18n import Translator
from .installable import Installable
from .installable import InstalledModule
_ = Translator("Koala", __file__)
class InstalledCog(Installable):
class InstalledCog(InstalledModule):
@classmethod
async def convert(cls, ctx: commands.Context, arg: str) -> Installable:
async def convert(cls, ctx: commands.Context, arg: str) -> InstalledModule:
downloader = ctx.bot.get_cog("Downloader")
if downloader is None:
raise commands.CommandError(_("No Downloader cog found."))

File diff suppressed because it is too large Load Diff

View File

@@ -1,7 +1,16 @@
from __future__ import annotations
from typing import List, TYPE_CHECKING
if TYPE_CHECKING:
from .repo_manager import Candidate
__all__ = [
"DownloaderException",
"GitException",
"InvalidRepoName",
"CopyingError",
"ExistingGitRepo",
"MissingGitRepo",
"CloningError",
@@ -10,6 +19,8 @@ __all__ = [
"UpdateError",
"GitDiffError",
"NoRemoteURL",
"UnknownRevision",
"AmbiguousRevision",
"PipError",
]
@@ -37,6 +48,15 @@ class InvalidRepoName(DownloaderException):
pass
class CopyingError(DownloaderException):
"""
Throw when there was an issue
during copying of module's files.
"""
pass
class ExistingGitRepo(DownloaderException):
"""
Thrown when trying to clone into a folder where a
@@ -105,6 +125,24 @@ class NoRemoteURL(GitException):
pass
class UnknownRevision(GitException):
"""
Thrown when specified revision cannot be found.
"""
pass
class AmbiguousRevision(GitException):
"""
Thrown when specified revision is ambiguous.
"""
def __init__(self, message: str, candidates: List[Candidate]) -> None:
super().__init__(message)
self.candidates = candidates
class PipError(DownloaderException):
"""
Thrown when pip returns a non-zero return code.

View File

@@ -1,9 +1,11 @@
from __future__ import annotations
import json
import distutils.dir_util
import shutil
from enum import Enum
from enum import IntEnum
from pathlib import Path
from typing import MutableMapping, Any, TYPE_CHECKING
from typing import MutableMapping, Any, TYPE_CHECKING, Optional, Dict, Union, Callable, Tuple, cast
from .log import log
from .json_mixins import RepoJSONMixin
@@ -11,10 +13,11 @@ from .json_mixins import RepoJSONMixin
from redbot.core import __version__, version_info as red_version_info, VersionInfo
if TYPE_CHECKING:
from .repo_manager import RepoManager
from .repo_manager import RepoManager, Repo
class InstallableType(Enum):
class InstallableType(IntEnum):
# using IntEnum, because hot-reload breaks its identity
UNKNOWN = 0
COG = 1
SHARED_LIBRARY = 2
@@ -34,6 +37,10 @@ class Installable(RepoJSONMixin):
----------
repo_name : `str`
Name of the repository which this package belongs to.
repo : Repo, optional
Repo object of the Installable, if repo is missing this will be `None`
commit : `str`, optional
Installable's commit. This is not the same as ``repo.commit``
author : `tuple` of `str`, optional
Name(s) of the author(s).
bot_version : `tuple` of `int`
@@ -58,30 +65,36 @@ class Installable(RepoJSONMixin):
"""
def __init__(self, location: Path):
def __init__(self, location: Path, repo: Optional[Repo] = None, commit: str = ""):
"""Base installable initializer.
Parameters
----------
location : pathlib.Path
Location (file or folder) to the installable.
repo : Repo, optional
Repo object of the Installable, if repo is missing this will be `None`
commit : str
Installable's commit. This is not the same as ``repo.commit``
"""
super().__init__(location)
self._location = location
self.repo = repo
self.repo_name = self._location.parent.stem
self.commit = commit
self.author = ()
self.author: Tuple[str, ...] = ()
self.min_bot_version = red_version_info
self.max_bot_version = red_version_info
self.min_python_version = (3, 5, 1)
self.hidden = False
self.disabled = False
self.required_cogs = {} # Cog name -> repo URL
self.requirements = ()
self.tags = ()
self.required_cogs: Dict[str, str] = {} # Cog name -> repo URL
self.requirements: Tuple[str, ...] = ()
self.tags: Tuple[str, ...] = ()
self.type = InstallableType.UNKNOWN
if self._info_file.exists():
@@ -90,15 +103,15 @@ class Installable(RepoJSONMixin):
if self._info == {}:
self.type = InstallableType.COG
def __eq__(self, other):
def __eq__(self, other: Any) -> bool:
# noinspection PyProtectedMember
return self._location == other._location
def __hash__(self):
def __hash__(self) -> int:
return hash(self._location)
@property
def name(self):
def name(self) -> str:
"""`str` : The name of this package."""
return self._location.stem
@@ -111,6 +124,7 @@ class Installable(RepoJSONMixin):
:return: Status of installation
:rtype: bool
"""
copy_func: Callable[..., Any]
if self._location.is_file():
copy_func = shutil.copy2
else:
@@ -121,18 +135,20 @@ class Installable(RepoJSONMixin):
# noinspection PyBroadException
try:
copy_func(src=str(self._location), dst=str(target_dir / self._location.stem))
except:
except: # noqa: E722
log.exception("Error occurred when copying path: {}".format(self._location))
return False
return True
def _read_info_file(self):
def _read_info_file(self) -> None:
super()._read_info_file()
if self._info_file.exists():
self._process_info_file()
def _process_info_file(self, info_file_path: Path = None) -> MutableMapping[str, Any]:
def _process_info_file(
self, info_file_path: Optional[Path] = None
) -> MutableMapping[str, Any]:
"""
Processes an information file. Loads dependencies among other
information into this object.
@@ -145,7 +161,7 @@ class Installable(RepoJSONMixin):
if info_file_path is None or not info_file_path.is_file():
raise ValueError("No valid information file path was found.")
info = {}
info: Dict[str, Any] = {}
with info_file_path.open(encoding="utf-8") as f:
try:
info = json.load(f)
@@ -174,7 +190,7 @@ class Installable(RepoJSONMixin):
self.max_bot_version = max_bot_version
try:
min_python_version = tuple(info.get("min_python_version", [3, 5, 1]))
min_python_version = tuple(info.get("min_python_version", (3, 5, 1)))
except ValueError:
min_python_version = self.min_python_version
self.min_python_version = min_python_version
@@ -212,14 +228,51 @@ class Installable(RepoJSONMixin):
return info
def to_json(self):
return {"repo_name": self.repo_name, "cog_name": self.name}
class InstalledModule(Installable):
"""Base class for installed modules,
this is basically instance of installed `Installable`
used by Downloader.
Attributes
----------
pinned : `bool`
Whether or not this cog is pinned, always `False` if module is not a cog.
"""
def __init__(
self,
location: Path,
repo: Optional[Repo] = None,
commit: str = "",
pinned: bool = False,
json_repo_name: str = "",
):
super().__init__(location=location, repo=repo, commit=commit)
self.pinned: bool = pinned if self.type == InstallableType.COG else False
# this is here so that Downloader could use real repo name instead of "MISSING_REPO"
self._json_repo_name = json_repo_name
def to_json(self) -> Dict[str, Union[str, bool]]:
module_json: Dict[str, Union[str, bool]] = {
"repo_name": self.repo_name,
"module_name": self.name,
"commit": self.commit,
}
if self.type == InstallableType.COG:
module_json["pinned"] = self.pinned
return module_json
@classmethod
def from_json(cls, data: dict, repo_mgr: "RepoManager"):
repo_name = data["repo_name"]
cog_name = data["cog_name"]
def from_json(
cls, data: Dict[str, Union[str, bool]], repo_mgr: RepoManager
) -> InstalledModule:
repo_name = cast(str, data["repo_name"])
cog_name = cast(str, data["module_name"])
commit = cast(str, data.get("commit", ""))
pinned = cast(bool, data.get("pinned", False))
# TypedDict, where are you :/
repo = repo_mgr.get_repo(repo_name)
if repo is not None:
repo_folder = repo.folder_path
@@ -228,4 +281,12 @@ class Installable(RepoJSONMixin):
location = repo_folder / cog_name
return cls(location=location)
return cls(
location=location, repo=repo, commit=commit, pinned=pinned, json_repo_name=repo_name
)
@classmethod
def from_installable(cls, module: Installable, *, pinned: bool = False) -> InstalledModule:
return cls(
location=module._location, repo=module.repo, commit=module.commit, pinned=pinned
)

View File

@@ -1,5 +1,6 @@
import json
from pathlib import Path
from typing import Optional, Tuple, Dict, Any
class RepoJSONMixin:
@@ -8,18 +9,18 @@ class RepoJSONMixin:
def __init__(self, repo_folder: Path):
self._repo_folder = repo_folder
self.author = None
self.install_msg = None
self.short = None
self.description = None
self.author: Optional[Tuple[str, ...]] = None
self.install_msg: Optional[str] = None
self.short: Optional[str] = None
self.description: Optional[str] = None
self._info_file = repo_folder / self.INFO_FILE_NAME
if self._info_file.exists():
self._read_info_file()
self._info = {}
self._info: Dict[str, Any] = {}
def _read_info_file(self):
def _read_info_file(self) -> None:
if not (self._info_file.exists() or self._info_file.is_file()):
return

View File

@@ -1,3 +1,5 @@
from __future__ import annotations
import asyncio
import functools
import os
@@ -7,23 +9,71 @@ import shutil
import re
from concurrent.futures import ThreadPoolExecutor
from pathlib import Path
from subprocess import run as sp_run, PIPE
from subprocess import run as sp_run, PIPE, CompletedProcess
from string import Formatter
from sys import executable
from typing import List, Tuple, Iterable, MutableMapping, Union, Optional
from typing import (
Any,
AsyncContextManager,
Awaitable,
Dict,
Generator,
Iterable,
List,
NamedTuple,
Optional,
Tuple,
)
from redbot.core import data_manager, commands
import discord
from redbot.core import data_manager, commands, Config
from redbot.core.utils import safe_delete
from redbot.core.i18n import Translator
from . import errors
from .installable import Installable, InstallableType
from .installable import Installable, InstallableType, InstalledModule
from .json_mixins import RepoJSONMixin
from .log import log
_ = Translator("RepoManager", __file__)
class Candidate(NamedTuple):
rev: str
object_type: str
description: str
class _RepoCheckoutCtxManager(
Awaitable[None], AsyncContextManager[None]
): # pylint: disable=duplicate-bases
def __init__(
self,
repo: Repo,
rev: Optional[str],
exit_to_rev: Optional[str] = None,
force_checkout: bool = False,
):
self.repo = repo
self.rev = rev
if exit_to_rev is None:
self.exit_to_rev = self.repo.commit
else:
self.exit_to_rev = exit_to_rev
self.force_checkout = force_checkout
self.coro = repo._checkout(self.rev, force_checkout=self.force_checkout)
def __await__(self) -> Generator[Any, None, None]:
return self.coro.__await__()
async def __aenter__(self) -> None:
await self
async def __aexit__(self, exc_type, exc, tb) -> None:
if self.rev is not None:
await self.repo._checkout(self.exit_to_rev, force_checkout=self.force_checkout)
class ProcessFormatter(Formatter):
def vformat(self, format_string, args, kwargs):
return shlex.split(super().vformat(format_string, args, kwargs))
@@ -38,27 +88,49 @@ class ProcessFormatter(Formatter):
class Repo(RepoJSONMixin):
GIT_CLONE = "git clone --recurse-submodules -b {branch} {url} {folder}"
GIT_CLONE_NO_BRANCH = "git clone --recurse-submodules {url} {folder}"
GIT_CURRENT_BRANCH = "git -C {path} rev-parse --abbrev-ref HEAD"
GIT_CURRENT_BRANCH = "git -C {path} symbolic-ref --short HEAD"
GIT_CURRENT_COMMIT = "git -C {path} rev-parse HEAD"
GIT_LATEST_COMMIT = "git -C {path} rev-parse {branch}"
GIT_HARD_RESET = "git -C {path} reset --hard origin/{branch} -q"
GIT_PULL = "git -C {path} pull --recurse-submodules -q --ff-only"
GIT_DIFF_FILE_STATUS = "git -C {path} diff --no-commit-id --name-status {old_hash} {new_hash}"
GIT_LOG = "git -C {path} log --relative-date --reverse {old_hash}.. {relative_file_path}"
GIT_DIFF_FILE_STATUS = (
"git -C {path} diff-tree --no-commit-id --name-status"
" -r -z --line-prefix='\t' {old_rev} {new_rev}"
)
GIT_LOG = "git -C {path} log --relative-date --reverse {old_rev}.. {relative_file_path}"
GIT_DISCOVER_REMOTE_URL = "git -C {path} config --get remote.origin.url"
GIT_CHECKOUT = "git -C {path} checkout {rev}"
GIT_GET_FULL_SHA1 = "git -C {path} rev-parse --verify {rev}^{{commit}}"
GIT_IS_ANCESTOR = (
"git -C {path} merge-base --is-ancestor {maybe_ancestor_rev} {descendant_rev}"
)
GIT_CHECK_IF_MODULE_EXISTS = "git -C {path} cat-file -e {rev}:{module_name}/__init__.py"
# ↓ this gives a commit after last occurrence
GIT_GET_LAST_MODULE_OCCURRENCE_COMMIT = (
"git -C {path} log --diff-filter=D --pretty=format:%H -n 1 {descendant_rev}"
" -- {module_name}/__init__.py"
)
PIP_INSTALL = "{python} -m pip install -U -t {target_dir} {reqs}"
MODULE_FOLDER_REGEX = re.compile(r"(\w+)\/")
AMBIGUOUS_ERROR_REGEX = re.compile(
r"^hint: {3}(?P<rev>[A-Za-z0-9]+) (?P<type>commit|tag) (?P<desc>.+)$", re.MULTILINE
)
def __init__(
self,
name: str,
url: str,
branch: str,
branch: Optional[str],
commit: str,
folder_path: Path,
available_modules: Tuple[Installable] = (),
loop: asyncio.AbstractEventLoop = None,
available_modules: Tuple[Installable, ...] = (),
loop: Optional[asyncio.AbstractEventLoop] = None,
):
self.url = url
self.branch = branch
self.commit = commit
self.name = name
@@ -73,12 +145,10 @@ class Repo(RepoJSONMixin):
self._repo_lock = asyncio.Lock()
self._loop = loop
if self._loop is None:
self._loop = asyncio.get_event_loop()
self._loop = loop if loop is not None else asyncio.get_event_loop()
@classmethod
async def convert(cls, ctx: commands.Context, argument: str):
async def convert(cls, ctx: commands.Context, argument: str) -> Repo:
downloader_cog = ctx.bot.get_cog("Downloader")
if downloader_cog is None:
raise commands.CommandError(_("No Downloader cog found."))
@@ -92,26 +162,82 @@ class Repo(RepoJSONMixin):
)
return poss_repo
def _existing_git_repo(self) -> (bool, Path):
def _existing_git_repo(self) -> Tuple[bool, Path]:
git_path = self.folder_path / ".git"
return git_path.exists(), git_path
async def _get_file_update_statuses(
self, old_hash: str, new_hash: str
) -> MutableMapping[str, str]:
async def is_ancestor(self, maybe_ancestor_rev: str, descendant_rev: str) -> bool:
"""
Gets the file update status letters for each changed file between
the two hashes.
:param old_hash: Pre-update
:param new_hash: Post-update
:return: Mapping of filename -> status_letter
Check if the first is an ancestor of the second.
Parameters
----------
maybe_ancestor_rev : `str`
Revision to check if it is ancestor of :code:`descendant_rev`
descendant_rev : `str`
Descendant revision
Returns
-------
bool
`True` if :code:`maybe_ancestor_rev` is
ancestor of :code:`descendant_rev` or `False` otherwise
"""
valid_exit_codes = (0, 1)
p = await self._run(
ProcessFormatter().format(
self.GIT_DIFF_FILE_STATUS,
self.GIT_IS_ANCESTOR,
path=self.folder_path,
old_hash=old_hash,
new_hash=new_hash,
maybe_ancestor_rev=maybe_ancestor_rev,
descendant_rev=descendant_rev,
),
valid_exit_codes=valid_exit_codes,
)
if p.returncode in valid_exit_codes:
return not bool(p.returncode)
raise errors.GitException(
f"Git failed to determine if commit {maybe_ancestor_rev}"
f" is ancestor of {descendant_rev} for repo at path: {self.folder_path}"
)
async def is_on_branch(self) -> bool:
"""
Check if repo is currently on branch.
Returns
-------
bool
`True` if repo is on branch or `False` otherwise
"""
return await self.latest_commit() == self.commit
async def _get_file_update_statuses(
self, old_rev: str, new_rev: Optional[str] = None
) -> Dict[str, str]:
"""
Gets the file update status letters for each changed file between the two revisions.
Parameters
----------
old_rev : `str`
Pre-update revision
new_rev : `str`, optional
Post-update revision, defaults to repo's branch if not given
Returns
-------
Dict[str, str]
Mapping of filename -> status_letter
"""
if new_rev is None:
new_rev = self.branch
p = await self._run(
ProcessFormatter().format(
self.GIT_DIFF_FILE_STATUS, path=self.folder_path, old_rev=old_rev, new_rev=new_rev
)
)
@@ -120,21 +246,156 @@ class Repo(RepoJSONMixin):
"Git diff failed for repo at path: {}".format(self.folder_path)
)
stdout = p.stdout.strip().decode().split("\n")
stdout = p.stdout.strip(b"\t\n\x00 ").decode().split("\x00\t")
ret = {}
for filename in stdout:
# TODO: filter these filenames by ones in self.available_modules
status, _, filepath = filename.partition("\t")
status, __, filepath = filename.partition("\x00") # NUL character
ret[filepath] = status
return ret
async def _get_commit_notes(self, old_commit_hash: str, relative_file_path: str) -> str:
async def get_last_module_occurrence(
self, module_name: str, descendant_rev: Optional[str] = None
) -> Optional[Installable]:
"""
Gets module's `Installable` from last commit in which it still occurs.
Parameters
----------
module_name : str
Name of module to get.
descendant_rev : `str`, optional
Revision from which the module's commit must be
reachable (i.e. descendant commit),
defaults to repo's branch if not given.
Returns
-------
`Installable`
Module from last commit in which it still occurs
or `None` if it couldn't be found.
"""
if descendant_rev is None:
descendant_rev = self.branch
p = await self._run(
ProcessFormatter().format(
self.GIT_CHECK_IF_MODULE_EXISTS,
path=self.folder_path,
rev=descendant_rev,
module_name=module_name,
),
debug_only=True,
)
if p.returncode == 0:
async with self.checkout(descendant_rev):
return discord.utils.get(self.available_modules, name=module_name)
p = await self._run(
ProcessFormatter().format(
self.GIT_GET_LAST_MODULE_OCCURRENCE_COMMIT,
path=self.folder_path,
descendant_rev=descendant_rev,
module_name=module_name,
)
)
if p.returncode != 0:
raise errors.GitException(
"Git log failed for repo at path: {}".format(self.folder_path)
)
commit = p.stdout.decode().strip()
if commit:
async with self.checkout(f"{commit}~"):
return discord.utils.get(self.available_modules, name=module_name)
return None
async def _is_module_modified(self, module: Installable, other_hash: str) -> bool:
"""
Checks if given module was different in :code:`other_hash`.
Parameters
----------
module : `Installable`
Module to check.
other_hash : `str`
Hash to compare module to.
Returns
-------
bool
`True` if module was different, `False` otherwise.
"""
if module.commit == other_hash:
return False
for status in await self._get_file_update_statuses(other_hash, module.commit):
match = self.MODULE_FOLDER_REGEX.match(status)
if match is not None and match.group(1) == module.name:
return True
return False
async def get_modified_modules(
self, old_rev: str, new_rev: Optional[str] = None
) -> Tuple[Installable, ...]:
"""
Gets modified modules between the two revisions.
For every module that doesn't exist in :code:`new_rev`,
it will try to find last commit, where it still existed
Parameters
----------
old_rev : `str`
Pre-update revision, ancestor of :code:`new_rev`
new_rev : `str`, optional
Post-update revision, defaults to repo's branch if not given
Returns
-------
`tuple` of `Installable`
List of changed modules between the two revisions.
"""
if new_rev is None:
new_rev = self.branch
modified_modules = set()
# check differences
for status in await self._get_file_update_statuses(old_rev, new_rev):
match = self.MODULE_FOLDER_REGEX.match(status)
if match is not None:
modified_modules.add(match.group(1))
async with self.checkout(old_rev):
# save old modules
old_hash = self.commit
old_modules = self.available_modules
# save new modules
await self.checkout(new_rev)
modules = []
new_modules = self.available_modules
for old_module in old_modules:
if old_module.name not in modified_modules:
continue
try:
index = new_modules.index(old_module)
except ValueError:
# module doesn't exist in this revision, try finding previous occurrence
module = await self.get_last_module_occurrence(old_module.name, new_rev)
if module is not None and await self._is_module_modified(module, old_hash):
modules.append(module)
else:
modules.append(new_modules[index])
return tuple(modules)
async def _get_commit_notes(self, old_rev: str, relative_file_path: str) -> str:
"""
Gets the commit notes from git log.
:param old_commit_hash: Point in time to start getting messages
:param old_rev: Point in time to start getting messages
:param relative_file_path: Path relative to the repo folder of the file
to get messages for.
:return: Git commit note log
@@ -143,7 +404,7 @@ class Repo(RepoJSONMixin):
ProcessFormatter().format(
self.GIT_LOG,
path=self.folder_path,
old_hash=old_commit_hash,
old_rev=old_rev,
relative_file_path=relative_file_path,
)
)
@@ -156,7 +417,47 @@ class Repo(RepoJSONMixin):
return p.stdout.decode().strip()
def _update_available_modules(self) -> Tuple[str]:
async def get_full_sha1(self, rev: str) -> str:
"""
Gets full sha1 object name.
Parameters
----------
rev : str
Revision to search for full sha1 object name.
Raises
------
.UnknownRevision
When git cannot find provided revision.
.AmbiguousRevision
When git cannot resolve provided short sha1 to one commit.
Returns
-------
`str`
Full sha1 object name for provided revision.
"""
p = await self._run(
ProcessFormatter().format(self.GIT_GET_FULL_SHA1, path=self.folder_path, rev=rev)
)
if p.returncode != 0:
stderr = p.stderr.decode().strip()
ambiguous_error = f"error: short SHA1 {rev} is ambiguous\nhint: The candidates are:\n"
if not stderr.startswith(ambiguous_error):
raise errors.UnknownRevision(f"Revision {rev} cannot be found.")
candidates = []
for match in self.AMBIGUOUS_ERROR_REGEX.finditer(stderr, len(ambiguous_error)):
candidates.append(Candidate(match["rev"], match["type"], match["desc"]))
if candidates:
raise errors.AmbiguousRevision(f"Short SHA1 {rev} is ambiguous.", candidates)
raise errors.UnknownRevision(f"Revision {rev} cannot be found.")
return p.stdout.decode().strip()
def _update_available_modules(self) -> Tuple[Installable, ...]:
"""
Updates the available modules attribute for this repo.
:return: List of available modules.
@@ -175,22 +476,114 @@ class Repo(RepoJSONMixin):
"""
for file_finder, name, is_pkg in pkgutil.iter_modules(path=[str(self.folder_path)]):
if is_pkg:
curr_modules.append(Installable(location=self.folder_path / name))
self.available_modules = curr_modules
curr_modules.append(
Installable(location=self.folder_path / name, repo=self, commit=self.commit)
)
self.available_modules = tuple(curr_modules)
# noinspection PyTypeChecker
return tuple(self.available_modules)
return self.available_modules
async def _run(self, *args, **kwargs):
async def _run(
self,
*args: Any,
valid_exit_codes: Tuple[int, ...] = (0,),
debug_only: bool = False,
**kwargs: Any,
) -> CompletedProcess:
"""
Parameters
----------
valid_exit_codes : tuple
Specifies valid exit codes, used to determine
if stderr should be sent as debug or error level in logging.
When not provided, defaults to :code:`(0,)`
debug_only : bool
Specifies if stderr can be sent only as debug level in logging.
When not provided, defaults to `False`
"""
env = os.environ.copy()
env["GIT_TERMINAL_PROMPT"] = "0"
kwargs["env"] = env
async with self._repo_lock:
return await self._loop.run_in_executor(
self._executor, functools.partial(sp_run, *args, stdout=PIPE, **kwargs)
p: CompletedProcess = await self._loop.run_in_executor(
self._executor,
functools.partial(sp_run, *args, stdout=PIPE, stderr=PIPE, **kwargs),
)
stderr = p.stderr.decode().strip()
if stderr:
if debug_only or p.returncode in valid_exit_codes:
log.debug(stderr)
else:
log.error(stderr)
return p
async def _setup_repo(self) -> None:
self.commit = await self.current_commit()
self._read_info_file()
self._update_available_modules()
async def _checkout(self, rev: Optional[str] = None, force_checkout: bool = False) -> None:
if rev is None:
return
if not force_checkout and self.commit == rev:
return
exists, __ = self._existing_git_repo()
if not exists:
raise errors.MissingGitRepo(
"A git repo does not exist at path: {}".format(self.folder_path)
)
async def clone(self) -> Tuple[str]:
p = await self._run(
ProcessFormatter().format(self.GIT_CHECKOUT, path=self.folder_path, rev=rev)
)
if p.returncode != 0:
raise errors.UnknownRevision(
"Could not checkout to {}. This revision may not exist".format(rev)
)
await self._setup_repo()
def checkout(
self,
rev: Optional[str] = None,
*,
exit_to_rev: Optional[str] = None,
force_checkout: bool = False,
) -> _RepoCheckoutCtxManager:
"""
Checks out repository to provided revision.
The return value of this method can also be used as an asynchronous
context manager, i.e. with :code:`async with` syntax. This will
checkout repository to :code:`exit_to_rev` on exit of the context manager.
Parameters
----------
rev : str, optional
Revision to checkout to, when not provided, method won't do anything
exit_to_rev : str, optional
Revision to checkout to after exiting context manager,
when not provided, defaults to current commit
This will be ignored, when used with :code:`await` or when :code:`rev` is `None`.
force_checkout : bool
When `True` checkout will be done even
if :code:`self.commit` is the same as target hash
(applies to exiting context manager as well)
If provided revision isn't full sha1 hash,
checkout will be done no matter to this parameter.
Defaults to `False`.
Raises
------
.UnknownRevision
When git cannot checkout to provided revision.
"""
return _RepoCheckoutCtxManager(self, rev, exit_to_rev, force_checkout)
async def clone(self) -> Tuple[Installable, ...]:
"""Clone a new repo.
Returns
@@ -224,9 +617,9 @@ class Repo(RepoJSONMixin):
if self.branch is None:
self.branch = await self.current_branch()
self._read_info_file()
await self._setup_repo()
return self._update_available_modules()
return self.available_modules
async def current_branch(self) -> str:
"""Determine the current branch using git commands.
@@ -237,7 +630,7 @@ class Repo(RepoJSONMixin):
The current branch name.
"""
exists, _ = self._existing_git_repo()
exists, __ = self._existing_git_repo()
if not exists:
raise errors.MissingGitRepo(
"A git repo does not exist at path: {}".format(self.folder_path)
@@ -254,9 +647,33 @@ class Repo(RepoJSONMixin):
return p.stdout.decode().strip()
async def current_commit(self, branch: str = None) -> str:
async def current_commit(self) -> str:
"""Determine the current commit hash of the repo.
Returns
-------
str
The requested commit hash.
"""
exists, __ = self._existing_git_repo()
if not exists:
raise errors.MissingGitRepo(
"A git repo does not exist at path: {}".format(self.folder_path)
)
p = await self._run(
ProcessFormatter().format(self.GIT_CURRENT_COMMIT, path=self.folder_path)
)
if p.returncode != 0:
raise errors.CurrentHashError("Unable to determine commit hash.")
return p.stdout.decode().strip()
async def latest_commit(self, branch: Optional[str] = None) -> str:
"""Determine the latest commit hash of the repo.
Parameters
----------
branch : `str`, optional
@@ -271,7 +688,7 @@ class Repo(RepoJSONMixin):
if branch is None:
branch = self.branch
exists, _ = self._existing_git_repo()
exists, __ = self._existing_git_repo()
if not exists:
raise errors.MissingGitRepo(
"A git repo does not exist at path: {}".format(self.folder_path)
@@ -282,11 +699,11 @@ class Repo(RepoJSONMixin):
)
if p.returncode != 0:
raise errors.CurrentHashError("Unable to determine old commit hash.")
raise errors.CurrentHashError("Unable to determine latest commit hash.")
return p.stdout.decode().strip()
async def current_url(self, folder: Path = None) -> str:
async def current_url(self, folder: Optional[Path] = None) -> str:
"""
Discovers the FETCH URL for a Git repo.
@@ -316,7 +733,7 @@ class Repo(RepoJSONMixin):
return p.stdout.decode().strip()
async def hard_reset(self, branch: str = None) -> None:
async def hard_reset(self, branch: Optional[str] = None) -> None:
"""Perform a hard reset on the current repo.
Parameters
@@ -328,7 +745,8 @@ class Repo(RepoJSONMixin):
if branch is None:
branch = self.branch
exists, _ = self._existing_git_repo()
await self.checkout(branch)
exists, __ = self._existing_git_repo()
if not exists:
raise errors.MissingGitRepo(
"A git repo does not exist at path: {}".format(self.folder_path)
@@ -345,7 +763,7 @@ class Repo(RepoJSONMixin):
" the following path: {}".format(self.folder_path)
)
async def update(self) -> (str, str):
async def update(self) -> Tuple[str, str]:
"""Update the current branch of this repo.
Returns
@@ -354,10 +772,9 @@ class Repo(RepoJSONMixin):
:py:code`(old commit hash, new commit hash)`
"""
curr_branch = await self.current_branch()
old_commit = await self.current_commit(branch=curr_branch)
old_commit = await self.latest_commit()
await self.hard_reset(branch=curr_branch)
await self.hard_reset()
p = await self._run(ProcessFormatter().format(self.GIT_PULL, path=self.folder_path))
@@ -367,14 +784,11 @@ class Repo(RepoJSONMixin):
" for the repo located at path: {}".format(self.folder_path)
)
new_commit = await self.current_commit(branch=curr_branch)
await self._setup_repo()
self._update_available_modules()
self._read_info_file()
return old_commit, self.commit
return old_commit, new_commit
async def install_cog(self, cog: Installable, target_dir: Path) -> bool:
async def install_cog(self, cog: Installable, target_dir: Path) -> InstalledModule:
"""Install a cog to the target directory.
Parameters
@@ -386,8 +800,13 @@ class Repo(RepoJSONMixin):
Returns
-------
bool
The success of the installation.
`InstalledModule`
Cog instance.
Raises
------
.CopyingError
When cog couldn't be copied.
"""
if cog not in self.available_cogs:
@@ -399,11 +818,14 @@ class Repo(RepoJSONMixin):
if not target_dir.exists():
raise ValueError("That target directory does not exist.")
return await cog.copy_to(target_dir=target_dir)
if not await cog.copy_to(target_dir=target_dir):
raise errors.CopyingError("There was an issue during copying of cog's files")
return InstalledModule.from_installable(cog)
async def install_libraries(
self, target_dir: Path, req_target_dir: Path, libraries: Tuple[Installable] = ()
) -> bool:
self, target_dir: Path, req_target_dir: Path, libraries: Iterable[Installable] = ()
) -> Tuple[Tuple[InstalledModule, ...], Tuple[Installable, ...]]:
"""Install shared libraries to the target directory.
If :code:`libraries` is not specified, all shared libraries in the repo
@@ -420,26 +842,30 @@ class Repo(RepoJSONMixin):
Returns
-------
bool
The success of the installation.
tuple
2-tuple of installed and failed libraries.
"""
if len(libraries) > 0:
if libraries:
if not all([i in self.available_libraries for i in libraries]):
raise ValueError("Some given libraries are not available in this repo.")
else:
libraries = self.available_libraries
if len(libraries) > 0:
ret = True
if libraries:
installed = []
failed = []
for lib in libraries:
ret = (
ret
and await self.install_requirements(cog=lib, target_dir=req_target_dir)
if not (
await self.install_requirements(cog=lib, target_dir=req_target_dir)
and await lib.copy_to(target_dir=target_dir)
)
return ret
return True
):
failed.append(lib)
else:
installed.append(InstalledModule.from_installable(lib))
return (tuple(installed), tuple(failed))
return ((), ())
async def install_requirements(self, cog: Installable, target_dir: Path) -> bool:
"""Install a cog's requirements.
@@ -466,7 +892,9 @@ class Repo(RepoJSONMixin):
return await self.install_raw_requirements(cog.requirements, target_dir)
async def install_raw_requirements(self, requirements: Tuple[str], target_dir: Path) -> bool:
async def install_raw_requirements(
self, requirements: Iterable[str], target_dir: Path
) -> bool:
"""Install a list of requirements using pip.
Parameters
@@ -482,7 +910,7 @@ class Repo(RepoJSONMixin):
Success of the installation
"""
if len(requirements) == 0:
if not requirements:
return True
# TODO: Check and see if any of these modules are already available
@@ -503,7 +931,7 @@ class Repo(RepoJSONMixin):
return True
@property
def available_cogs(self) -> Tuple[Installable]:
def available_cogs(self) -> Tuple[Installable, ...]:
"""`tuple` of `installable` : All available cogs in this Repo.
This excludes hidden or shared packages.
@@ -514,7 +942,7 @@ class Repo(RepoJSONMixin):
)
@property
def available_libraries(self) -> Tuple[Installable]:
def available_libraries(self) -> Tuple[Installable, ...]:
"""`tuple` of `installable` : All available shared libraries in this
Repo.
"""
@@ -524,11 +952,14 @@ class Repo(RepoJSONMixin):
)
@classmethod
async def from_folder(cls, folder: Path):
repo = cls(name=folder.stem, branch="", url="", folder_path=folder)
repo.branch = await repo.current_branch()
async def from_folder(cls, folder: Path, branch: str = "") -> Repo:
repo = cls(name=folder.stem, url="", branch=branch, commit="", folder_path=folder)
repo.url = await repo.current_url()
repo._update_available_modules()
if branch == "":
repo.branch = await repo.current_branch()
repo._update_available_modules()
else:
await repo.checkout(repo.branch, force_checkout=True)
return repo
@@ -537,11 +968,13 @@ class RepoManager:
GITHUB_OR_GITLAB_RE = re.compile(r"https?://git(?:hub)|(?:lab)\.com/")
TREE_URL_RE = re.compile(r"(?P<tree>/tree)/(?P<branch>\S+)$")
def __init__(self):
self._repos = {}
def __init__(self) -> None:
self._repos: Dict[str, Repo] = {}
self.conf = Config.get_conf(self, identifier=170708480, force_registration=True)
self.conf.register_global(repos={})
async def initialize(self):
await self._load_repos(set=True)
async def initialize(self) -> None:
await self._load_repos(set_repos=True)
@property
def repos_folder(self) -> Path:
@@ -583,14 +1016,17 @@ class RepoManager:
url, branch = self._parse_url(url, branch)
# noinspection PyTypeChecker
r = Repo(url=url, name=name, branch=branch, folder_path=self.repos_folder / name)
r = Repo(
url=url, name=name, branch=branch, commit="", folder_path=self.repos_folder / name
)
await r.clone()
await self.conf.repos.set_raw(name, value=r.branch)
self._repos[name] = r
return r
def get_repo(self, name: str) -> Union[Repo, None]:
def get_repo(self, name: str) -> Optional[Repo]:
"""Get a Repo object for a repository.
Parameters
@@ -606,7 +1042,11 @@ class RepoManager:
"""
return self._repos.get(name, None)
def get_all_repo_names(self) -> Tuple[str]:
@property
def repos(self) -> Tuple[Repo, ...]:
return tuple(self._repos.values())
def get_all_repo_names(self) -> Tuple[str, ...]:
"""Get all repo names.
Returns
@@ -617,7 +1057,20 @@ class RepoManager:
# noinspection PyTypeChecker
return tuple(self._repos.keys())
async def delete_repo(self, name: str):
def get_all_cogs(self) -> Tuple[Installable, ...]:
"""Get all cogs.
Returns
-------
`tuple` of `Installable`
"""
all_cogs: List[Installable] = []
for repo in self._repos.values():
all_cogs += repo.available_cogs
return tuple(all_cogs)
async def delete_repo(self, name: str) -> None:
"""Delete a repository and its folders.
Parameters
@@ -637,41 +1090,59 @@ class RepoManager:
safe_delete(repo.folder_path)
await self.conf.repos.clear_raw(repo.name)
try:
del self._repos[name]
except KeyError:
pass
async def update_repo(self, repo_name: str) -> MutableMapping[Repo, Tuple[str, str]]:
async def update_repo(self, repo_name: str) -> Tuple[Repo, Tuple[str, str]]:
"""Update repo with provided name.
Parameters
----------
name : str
The name of the repository to update.
Returns
-------
Tuple[Repo, Tuple[str, str]]
A 2-`tuple` with Repo object and a 2-`tuple` of `str`
containing old and new commit hashes.
"""
repo = self._repos[repo_name]
old, new = await repo.update()
return {repo: (old, new)}
return (repo, (old, new))
async def update_all_repos(self) -> MutableMapping[Repo, Tuple[str, str]]:
async def update_all_repos(self) -> Dict[Repo, Tuple[str, str]]:
"""Call `Repo.update` on all repositories.
Returns
-------
dict
A mapping of `Repo` objects that received new commits to a `tuple`
of `str` containing old and new commit hashes.
Dict[Repo, Tuple[str, str]]
A mapping of `Repo` objects that received new commits to
a 2-`tuple` of `str` containing old and new commit hashes.
"""
ret = {}
for repo_name, _ in self._repos.items():
repo, (old, new) = (await self.update_repo(repo_name)).popitem()
for repo_name, __ in self._repos.items():
repo, (old, new) = await self.update_repo(repo_name)
if old != new:
ret[repo] = (old, new)
return ret
async def _load_repos(self, set=False) -> MutableMapping[str, Repo]:
async def _load_repos(self, set_repos: bool = False) -> Dict[str, Repo]:
ret = {}
self.repos_folder.mkdir(parents=True, exist_ok=True)
for folder in self.repos_folder.iterdir():
if not folder.is_dir():
continue
try:
ret[folder.stem] = await Repo.from_folder(folder)
branch = await self.conf.repos.get_raw(folder.stem, default="")
ret[folder.stem] = await Repo.from_folder(folder, branch)
if branch == "":
await self.conf.repos.set_raw(folder.stem, value=ret[folder.stem].branch)
except errors.NoRemoteURL:
log.warning("A remote URL does not exist for repo %s", folder.stem)
except errors.DownloaderException as err:
@@ -683,7 +1154,7 @@ class RepoManager:
),
)
if set:
if set_repos:
self._repos = ret
return ret