GNU bug report logs - #69794
[PATCH 0/2] Package some dependencies for Argos Translate

Previous Next

Package: guix-patches;

Reported by: Nguyễn Gia Phong <mcsinyx <at> disroot.org>

Date: Thu, 14 Mar 2024 08:32:02 UTC

Severity: normal

Tags: patch

To reply to this bug, email your comments to 69794 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to guix-patches <at> gnu.org:
bug#69794; Package guix-patches. (Thu, 14 Mar 2024 08:32:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Nguyễn Gia Phong <mcsinyx <at> disroot.org>:
New bug report received and forwarded. Copy sent to guix-patches <at> gnu.org. (Thu, 14 Mar 2024 08:32:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
To: guix-patches <at> gnu.org
Cc: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
Subject: [PATCH 0/2] Package some dependencies for Argos Translate
Date: Thu, 14 Mar 2024 17:29:17 +0900
Argos Translate <https://www.argosopentech.com>
is an offline translation library based on OpenNMT.

Below are some of its dependencies that are trivial to package.
The last one missing is CTranslate2 <https://opennmt.net/CTranslate2>.

Nguyễn Gia Phong (2):
  gnu: Add python-sacremoses.
  gnu: Add python-stanza.

 gnu/packages/machine-learning.scm | 30 +++++++++++++++++++++++++++
 gnu/packages/python-xyz.scm       | 34 +++++++++++++++++++++++++++++++
 2 files changed, 64 insertions(+)


base-commit: 76a3414a1bc500626a9feca013673f994eb51a34
-- 
2.41.0





Information forwarded to lars <at> 6xq.net, marius <at> gnu.org, me <at> bonfacemunyoki.com, sharlatanus <at> gmail.com, jgart <at> dismail.de, guix-patches <at> gnu.org:
bug#69794; Package guix-patches. (Thu, 14 Mar 2024 08:34:01 GMT) Full text and rfc822 format available.

Message #8 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
To: guix-patches <at> gnu.org
Cc: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
Subject: [PATCH 1/2] gnu: Add python-sacremoses.
Date: Thu, 14 Mar 2024 17:32:22 +0900
* gnu/packages/python-xyz.scm (python-sacremoses): New variable.

Change-Id: I2c2cd94c054d7e952ffb4b3afdedd2ee8ce905bf
---
 gnu/packages/python-xyz.scm | 34 ++++++++++++++++++++++++++++++++++
 1 file changed, 34 insertions(+)

diff --git a/gnu/packages/python-xyz.scm b/gnu/packages/python-xyz.scm
index 232b5d69993c..ad33d98db142 100644
--- a/gnu/packages/python-xyz.scm
+++ b/gnu/packages/python-xyz.scm
@@ -149,6 +149,7 @@
 ;;; Copyright © 2024 Timothee Mathieu <timothee.mathieu <at> inria.fr>
 ;;; Copyright © 2024 Ian Eure <ian <at> retrospec.tv>
 ;;; Copyright © 2024 Adriel Dumas--Jondeau <leirda <at> disroot.org>
+;;; Copyright © 2024 Nguyễn Gia Phong <mcsinyx <at> disroot.org>
 ;;;
 ;;; This file is part of GNU Guix.
 ;;;
@@ -21897,6 +21898,39 @@ (define-public python-nltk
      reasoning, wrappers for natural language processing libraries.")
     (license license:asl2.0)))
 
+(define-public python-sacremoses
+  (package
+    (name "python-sacremoses")
+    (version "0.1.0")
+    (source (origin
+              (method git-fetch)
+              (uri (git-reference
+                     (url "https://github.com/hplt-project/sacremoses")
+                     (commit version)))
+              (sha256
+                (base32
+                  "0g70vchfniknp65n4wnx7chg6g49d4xrz1wagv7f7ir2swdzyn9b"))))
+    (build-system python-build-system)
+    (arguments
+      '(#:phases
+         (modify-phases %standard-phases
+           (replace 'check
+             (lambda* (#:key tests? #:allow-other-keys)
+               (when tests?
+                 ;; Skip truecaser tests which fetch https://norvig.com/big.txt
+                 (invoke "python" "-m" "unittest"
+                         "sacremoses/test/test_corpus.py"
+                         "sacremoses/test/test_no_redos_has_numeric_only.py"
+                         "sacremoses/test/test_normalizer.py"
+                         "sacremoses/test/test_tokenizer.py")))))))
+    (propagated-inputs
+      (list python-click-7 python-joblib python-regex python-tqdm))
+    (home-page "https://github.com/hplt-project/sacremoses")
+    (synopsis "Natural language tokenizer, truecaser and normalizer")
+    (description "SacreMoses is a Python port of Moses'
+tokenizer, detokenizer, truecaser and punctuation normalizer.")
+    (license license:expat)))
+
 (define-public python-pymongo
   (package
     (name "python-pymongo")
-- 
2.41.0





Information forwarded to guix-patches <at> gnu.org:
bug#69794; Package guix-patches. (Thu, 14 Mar 2024 08:34:02 GMT) Full text and rfc822 format available.

Message #11 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
To: guix-patches <at> gnu.org
Cc: Nguyễn Gia Phong <mcsinyx <at> disroot.org>
Subject: [PATCH 2/2] gnu: Add python-stanza.
Date: Thu, 14 Mar 2024 17:32:23 +0900
* gnu/packages/machine-learning.scm (python-stanza): New variable.

Change-Id: Ibde67dcb8a015b91554f6a1e36dbf5eef0b73f36
---
 gnu/packages/machine-learning.scm | 30 ++++++++++++++++++++++++++++++
 1 file changed, 30 insertions(+)

diff --git a/gnu/packages/machine-learning.scm b/gnu/packages/machine-learning.scm
index 5c18a2e9d57d..5e403d905c49 100644
--- a/gnu/packages/machine-learning.scm
+++ b/gnu/packages/machine-learning.scm
@@ -27,6 +27,7 @@
 ;;; Copyright © 2024 David Pflug <david <at> pflug.io>
 ;;; Copyright © 2024 Timothee Mathieu <timothee.mathieu <at> inria.fr>
 ;;; Copyright © 2024 Spencer King <spencer.king <at> geneoscopy.com>
+;;; Copyright © 2024 Nguyễn Gia Phong <mcsinyx <at> disroot.org>
 ;;;
 ;;; This file is part of GNU Guix.
 ;;;
@@ -1127,6 +1128,35 @@ (define-public python-spacy
 model packaging, deployment and workflow management.")
     (license license:expat)))
 
+(define-public python-stanza
+  (package
+    (name "python-stanza")
+    (version "1.8.1")
+    (source
+      (origin
+        (method url-fetch)
+        (uri (pypi-uri "stanza" version))
+        (sha256
+          (base32 "1drq9wyafisnf44jgby1sh45svp0pj2svb01v397i9h0bczc5i08"))))
+    (build-system python-build-system)
+    (propagated-inputs (list python-emoji
+                             python-numpy
+                             python-protobuf
+                             python-requests
+                             python-networkx
+                             python-toml
+                             python-pytorch
+                             python-tqdm))
+    ;; Tests require downloading of datasets.
+    (arguments (list #:tests? #false))
+    (home-page "https://stanfordnlp.github.io/stanza")
+    (synopsis "Stanford NLP Python library for many human languages")
+    (description "Stanza is a collection of accurate and efficient tools
+for the linguistic analysis of many human languages.  Starting from raw text,
+Stanza divides it into sentences and words, and then can recognize
+parts of speech and entities, do syntactic analysis, and more.")
+    (license license:asl2.0)))
+
 (define-public shogun
   (package
     (name "shogun")
-- 
2.41.0





Information forwarded to guix-patches <at> gnu.org:
bug#69794; Package guix-patches. (Thu, 14 Mar 2024 08:35:01 GMT) Full text and rfc822 format available.

Information forwarded to lars <at> 6xq.net, marius <at> gnu.org, me <at> bonfacemunyoki.com, sharlatanus <at> gmail.com, jgart <at> dismail.de, guix-patches <at> gnu.org:
bug#69794; Package guix-patches. (Thu, 14 Mar 2024 08:36:02 GMT) Full text and rfc822 format available.

This bug report was last modified 51 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.