guix-patches
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug#68405] [PATCH v2] guix: download: Add support for git repositories.


From: Maxim Cournoyer
Subject: [bug#68405] [PATCH v2] guix: download: Add support for git repositories.
Date: Thu, 18 Jan 2024 23:16:42 -0500
User-agent: Gnus/5.13 (Gnus v5.13)

Hello,

Romain GARBAGE <romain.garbage@inria.fr> writes:

> Added `--recursive' option.

I still see a TODO about supporting recursive repos in the code.  Is
that still the case?

> Removed `pk' call.
>
> * guix/scripts/download.scm (git-download-to-store*): Add new variable.
>   (copy-recursively-without-dot-git): New variable.
>   (git-download-to-file): Add new variable.
>   (show-help): Add 'git', 'commit', 'branch' and 'recursive'options
>   help message.
>   (%default-options): Add default value for 'git-reference' and
>   'recursive' options.
>   (%options): Add 'git', 'commit', 'branch' and 'recursive' command
>   line options.
>   (guix-download) [hash]: Compute hash with 'file-hash*' instead of
>   'port-hash' from (gcrypt hash) module. This allows us to compute
>   hashes for directories.
> * doc/guix.texi (Invoking guix-download): Add @item entries for
>   `git', `commit', `branch' and `recursive' options. Add a paragraph in
>   the introduction.
> * tests/guix-download.sh: New tests.

This sounds good and is something that I'm many many of us have wanted
for some time.  Thank you for working on it!

Nitpick about the commit message: the convention seems to be to not use
a hanging indent when writing GNU ChangeLog messages.

> ---
>  doc/guix.texi             |  23 ++++++
>  guix/scripts/download.scm | 146 ++++++++++++++++++++++++++++++++++----
>  tests/guix-download.sh    |  42 +++++++++++
>  3 files changed, 199 insertions(+), 12 deletions(-)
>
> diff --git a/doc/guix.texi b/doc/guix.texi
> index 3002cdfa13..d3b40e878b 100644
> --- a/doc/guix.texi
> +++ b/doc/guix.texi
> @@ -13983,6 +13983,9 @@ the certificates of X.509 authorities from the 
> directory pointed to by
>  the @env{SSL_CERT_DIR} environment variable (@pxref{X.509
>  Certificates}), unless @option{--no-check-certificate} is used.
>
> +Alternatively, @command{guix download} can also retrieve a Git
> +repository, possibly a specific commit, tag, or branch.
> +
>  The following options are available:
>
>  @table @code
> @@ -14007,6 +14010,26 @@ URL, which makes you vulnerable to 
> ``man-in-the-middle'' attacks.
>  @itemx -o @var{file}
>  Save the downloaded file to @var{file} instead of adding it to the
>  store.
> +
> +@item --git
> +@itemx -g
> +Checkout the Git repository at the latest commit on the default branch.
> +
> +@item --commit=@var{commit-or-tag}
> +Checkout the Git repository at @var{commit-or-tag}.
> +
> +@var{commit-or-tag} can be either a tag or a commit defined in the Git
> +repository.
> +
> +@item --branch=@var{branch}
> +Checkout the Git repository at @var{branch}.
> +
> +The repository will be checked out at the latest commit of @var{branch},
> +which must be a valid branch of the Git repository.
> +
> +@item --recursive
> +@itemx -r
> +Recursively clone the Git repository.
>  @end table
>
>  @node Invoking guix hash
> diff --git a/guix/scripts/download.scm b/guix/scripts/download.scm
> index 19052d5652..50c9a43791 100644
> --- a/guix/scripts/download.scm
> +++ b/guix/scripts/download.scm
> @@ -22,17 +22,23 @@ (define-module (guix scripts download)
>    #:use-module (guix scripts)
>    #:use-module (guix store)
>    #:use-module (gcrypt hash)
> +  #:use-module (guix hash)
>    #:use-module (guix base16)
>    #:use-module (guix base32)
>    #:autoload   (guix base64) (base64-encode)
>    #:use-module ((guix download) #:hide (url-fetch))
> +  #:use-module ((guix git)
> +                #:select (latest-repository-commit
> +                          update-cached-checkout))
>    #:use-module ((guix build download)
>                  #:select (url-fetch))
> +  #:use-module (guix build utils)
>    #:use-module ((guix progress)
>                  #:select (current-terminal-columns))
>    #:use-module ((guix build syscalls)
>                  #:select (terminal-columns))
>    #:use-module (web uri)
> +  #:use-module (ice-9 ftw)
>    #:use-module (ice-9 match)
>    #:use-module (srfi srfi-1)
>    #:use-module (srfi srfi-26)
> @@ -54,6 +60,57 @@ (define (download-to-file url file)
>         (url-fetch url file #:mirrors %mirrors)))
>      file))
>
> +;; This is a simplified version of 'copy-recursively'.
> +;; It allows us to filter out the ".git" subfolder.
> +;; TODO: Remove when 'copy-recursively' supports '#:select?'.

Is a #:select? planned for copy-recursively?  (in the works?)

> +(define (copy-recursively-without-dot-git source destination)
> +  (define strip-source
> +    (let ((len (string-length source)))
> +      (lambda (file)
> +        (substring file len))))
> +
> +  (file-system-fold (lambda (file stat result) ; enter?
> +                      (not (string-suffix? "/.git" file)))
> +                    (lambda (file stat result) ; leaf
> +                      (let ((dest (string-append destination
> +                                                 (strip-source file))))
> +                        (case (stat:type stat)
> +                          ((symlink)
> +                           (let ((target (readlink file)))
> +                             (symlink target dest)))
> +                          (else
> +                           (copy-file file dest)))))
> +                    (lambda (dir stat result) ; down
> +                      (let ((target (string-append destination
> +                                                   (strip-source dir))))
> +                        (mkdir-p target)))
> +                    (const #t)          ; up
> +                    (const #t)          ; skip
> +                    (lambda (file stat errno result)
> +                      (format (current-error-port) "i/o error: ~a: ~a~%"
> +                              file (strerror errno))
> +                      #f)
> +                    #t
> +                    source))
> +
> +(define (git-download-to-file url file reference recursive?)
> +  "Download the git repo at URL to file, checked out at REFERENCE.
> +REFERENCE must be a pair argument as understood by 
> 'latest-repository-commit'.
> +Return FILE."
> +  ;; TODO: Support recursive repos.
> +  ;; 'libgit2' doesn't support the URL format generated by 'uri->string' so
> +  ;; we have to do a little fixup. Dropping completely the 'file:' protocol
> +  ;; part gives better performance.
>
> +  (let ((url* (cond ((string-prefix? "file://" url)
> +                     (string-drop url (string-length "file://")))
> +                    ((string-prefix? "file:" url)
> +                     (string-drop url (string-length "file:")))
> +                    (else url))))
> +    (copy-recursively-without-dot-git
> +     (update-cached-checkout url* #:ref reference #:recursive? recursive?)
> +     file))
> +  file)
> +
>  (define (ensure-valid-store-file-name name)
>    "Replace any character not allowed in a store name by an underscore."
>
> @@ -67,17 +124,36 @@ (define valid
>                name))
>
>
> -(define* (download-to-store* url #:key (verify-certificate? #t))
> +(define* (download-to-store* url #:key (verify-certificate? #t) 
> #:allow-other-keys)
>    (with-store store
>      (download-to-store store url
>                         (ensure-valid-store-file-name (basename url))
>                         #:verify-certificate? verify-certificate?)))
>
> +(define* (git-download-to-store* url reference recursive? #:key 
> (verify-certificate? #t))
> +  "Download the git repository at URL to the store, checked out at REFERENCE.
> +URL must specify a protocol (i.e https:// or file://), REFERENCE must be a
> +pair argument as understood by 'latest-repository-commit'."
> +  ;; Ensure the URL string is properly formatted  when using the 'file' 
> protocol:
> +  ;; URL is generated using 'uri->string', which returns 
> "file:/path/to/file" instead of
> +  ;; "file:///path/to/file", which in turn makes 'git-download-to-store' 
> fail.
> +  (let* ((file? (string-prefix? "file:" url))
> +         (url* (if (and file?
> +                        (not (string-prefix? "file:///" url)))
> +                   (string-append "file://" (string-replace url "" 0 
> (string-length "file:")))
> +                   url)))
> +    (with-store store
> +      ;; TODO: Support recursive repos.
> +      ;; TODO: Verify certificate support and deactivation.
> +      (latest-repository-commit store url* #:recursive? recursive? #:ref 
> reference))))
> +

Some lines look like > 80 chars here.  Please break long lines
accordingly.

>  (define %default-options
>    ;; Alist of default option values.
>    `((format . ,bytevector->nix-base32-string)
>      (hash-algorithm . ,(hash-algorithm sha256))
>      (verify-certificate? . #t)
> +    (git-reference . #f)
> +    (recursive? . #f)
>      (download-proc . ,download-to-store*)))
>
>  (define (show-help)
> @@ -97,6 +173,19 @@ (define (show-help)
>                           do not validate the certificate of HTTPS servers "))
>    (format #t (G_ "
>    -o, --output=FILE      download to FILE"))
> +  (format #t (G_ "
> +  -g, --git              download the default branch's latest commit of the
> +                         git repository at URL"))
> +  (format #t (G_ "
> +      --commit=COMMIT_OR_TAG
> +                         download the given commit or tag of the git
> +                         repository at URL"))
> +  (format #t (G_ "
> +      --branch=BRANCH    download the given branch of the git repository
> +                         at URL"))
> +  (format #t (G_ "
> +  -r, --recursive        download a git repository recursively"))
> +
>    (newline)
>    (display (G_ "
>    -h, --help             display this help and exit"))
> @@ -105,6 +194,13 @@ (define (show-help)
>    (newline)
>    (show-bug-report-information))
>
> +(define (add-git-download-option result)
> +  (alist-cons 'download-proc
> +              ;; XXX: #:verify-certificate? currently ignored.
> +              (lambda* (url #:key verify-certificate? ref recursive?)
> +                (git-download-to-store* url ref recursive?))
> +              (alist-delete 'download result)))
> +
>  (define %options
>    ;; Specifications of the command-line options.
>    (list (option '(#\f "format") #t #f
> @@ -136,11 +232,36 @@ (define fmt-proc
>                    (alist-cons 'verify-certificate? #f result)))
>          (option '(#\o "output") #t #f
>                  (lambda (opt name arg result)
> -                  (alist-cons 'download-proc
> -                              (lambda* (url #:key verify-certificate?)
> -                                (download-to-file url arg))
> -                              (alist-delete 'download result))))
> -
> +                  (let* ((git
> +                          (assoc-ref result 'git-reference)))
> +                    (if git
> +                        (alist-cons 'download-proc
> +                                    (lambda* (url #:key verify-certificate? 
> ref recursive?)
> +                                      (git-download-to-file url arg 
> (assoc-ref result 'git-reference) recursive?))
> +                                    (alist-delete 'download result))
> +                        (alist-cons 'download-proc
> +                                    (lambda* (url #:key verify-certificate? 
> #:allow-other-keys)
> +                                      (download-to-file url arg))
> +                                    (alist-delete 'download result))))))
> +        (option '(#\g "git") #f #f
> +                (lambda (opt name arg result)
> +                  ;; Ignore this option if 'commit' or 'branch' has
> +                  ;; already been provided
> +                  (if (assoc-ref result 'git-reference)
> +                      result
> +                      (alist-cons 'git-reference '()
> +                                  (add-git-download-option result)))))
> +        (option '("commit") #t #f
> +                (lambda (opt name arg result)
> +                  (alist-cons 'git-reference `(tag-or-commit . ,arg)
> +                              (add-git-download-option result))))
> +        (option '("branch") #t #f
> +                (lambda (opt name arg result)
> +                  (alist-cons 'git-reference `(branch . ,arg)
> +                              (alist-delete 'git-reference result))))
> +        (option '(#\r "recursive") #f #f
> +                (lambda (opt name arg result)
> +                  (alist-cons 'recursive? #t result)))
>          (option '(#\h "help") #f #f
>                  (lambda args
>                    (leave-on-EPIPE (show-help))
> @@ -183,12 +304,13 @@ (define (parse-options)
>                                    (terminal-columns)))
>                      (fetch (uri->string uri)
>                             #:verify-certificate?
> -                           (assq-ref opts 'verify-certificate?))))
> -           (hash  (call-with-input-file
> -                      (or path
> -                          (leave (G_ "~a: download failed~%")
> -                                 arg))
> -                    (cute port-hash (assoc-ref opts 'hash-algorithm) <>)))
> +                           (assq-ref opts 'verify-certificate?)
> +                           #:ref (assq-ref opts 'git-reference)
> +                           #:recursive? (assq-ref opts 'recursive?))))
> +           (hash  (let* ((path* (or path
> +                                  (leave (G_ "~a: download failed~%")
> +                                         arg))))
> +                   (file-hash* path* #:algorithm (assoc-ref opts 
> 'hash-algorithm))))

Here also there are some too long lines in the above hunks; please break
long lines so they fit within the 80 characters limit.

>             (fmt   (assq-ref opts 'format)))
>        (format #t "~a~%~a~%" path (fmt hash))
>        #t)))
> diff --git a/tests/guix-download.sh b/tests/guix-download.sh
> index f4cb335eef..3bf63c4b12 100644
> --- a/tests/guix-download.sh
> +++ b/tests/guix-download.sh
> @@ -45,4 +45,46 @@ cmp "$output" "$abs_top_srcdir/README"
>  # This one should fail.
>  guix download "file:///does-not-exist" "file://$abs_top_srcdir/README" && 
> false
>
> +# Test git support with local repository

Nitpick: please punctuate standalone comments (here, a missing period).

> +test_directory="$(mktemp -d)"
> +trap 'chmod -Rf +w "$test_directory"; rm -rf "$test_directory" ; rm -f 
> "$output"' EXIT

the 'chmod' doesn't seem to be useful; since we force removing with -f ?
And where did the $output variable come from?

> +
> +# Create a dummy git repo in the temporary directory
> +(
> +    cd $test_directory
> +    git init
> +    touch test
> +    git config user.name "User"
> +    git config user.email "user@domain"
> +    git add test
> +    git commit -m "Commit"
> +    git tag -a -m "v1" v1
> +)
> +
> +# Extract commit number
> +commit=$((cd $test_directory && git log) | head -n 1 | cut -f2 -d' ')
> +
> +# We expect that guix hash is working properly or at least that the output of
> +# 'guix download' is consistent with 'guix hash'
> +expected_hash=$(guix hash -rx $test_directory)
> +
> +# Test the different options
> +for option in "" "--commit=$commit" "--commit=v1" "--branch=master"
> +do
> +    command_output="$(guix download --git $option "file://$test_directory")"
> +    computed_hash="$(echo $command_output | cut -f2 -d' ')"
> +    store_path="$(echo $command_output | cut -f1 -d' ')"
> +    [ "$expected_hash" = "$computed_hash" ]
> +    diff -r -x ".git" $test_directory $store_path
> +done
> +
> +# Should fail
> +guix download --git --branch=non_existent "file://$test_directory" && false
> +
> +# Same but download to file instead of store
> +tmpdir="t-archive-dir-$$"
> +trap 'chmod -Rf +w "$test_directory"; rm -rf "$test_directory" ; rm -f 
> "$output" ; rm -rf "$tmpdir"' EXIT

It'd look nicer if there was a single global trap call at the top of
these tests.  Don't forget to  punctuate your comments :-).

Otherwise, it looks good to me, although I haven't tried it.

-- 
Thanks,
Maxim





reply via email to

[Prev in Thread] Current Thread [Next in Thread]