GNU bug report logs - #59355
[PATCH] gnu: Add dataparksearch.

Previous Next

Package: guix-patches;

Reported by: Adam Faiz <adam.faiz <at> disroot.org>

Date: Fri, 18 Nov 2022 11:18:02 UTC

Severity: normal

Tags: patch

Done: Nicolas Goaziou <mail <at> nicolasgoaziou.fr>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 59355 in the body.
You can then email your comments to 59355 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Fri, 18 Nov 2022 11:18:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Adam Faiz <adam.faiz <at> disroot.org>:
New bug report received and forwarded. Copy sent to guix-patches <at> gnu.org. (Fri, 18 Nov 2022 11:18:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Adam Faiz <adam.faiz <at> disroot.org>
To: guix-patches <at> gnu.org
Subject: [PATCH] gnu: Add dataparksearch.
Date: Fri, 18 Nov 2022 19:17:27 +0800
From 39b9fb1e69fb61acdadb697690becb908d99c607 Mon Sep 17 00:00:00 2001
From: AwesomeAdam54321 <adam.faiz <at> disroot.org>
Date: Fri, 18 Nov 2022 08:55:02 +0800
Subject: [PATCH] gnu: Add dataparksearch.

* gnu/packages/search.scm (dataparksearch): New variable.
---
 gnu/packages/search.scm | 61 +++++++++++++++++++++++++++++++++++++++++
 1 file changed, 61 insertions(+)

diff --git a/gnu/packages/search.scm b/gnu/packages/search.scm
index 27dc9cd02e..db3ca59137 100644
--- a/gnu/packages/search.scm
+++ b/gnu/packages/search.scm
@@ -40,6 +40,7 @@ (define-module (gnu packages search)
   #:use-module (guix build-system python)
   #:use-module (guix build-system meson)
   #:use-module (gnu packages)
+  #:use-module (gnu packages adns)
   #:use-module (gnu packages aspell)
   #:use-module (gnu packages autotools)
   #:use-module (gnu packages base)
@@ -51,6 +52,7 @@ (define-module (gnu packages search)
   #:use-module (gnu packages gawk)
   #:use-module (gnu packages gettext)
   #:use-module (gnu packages glib)
+  #:use-module (gnu packages gnunet)
   #:use-module (gnu packages groff)
   #:use-module (gnu packages gtk)
   #:use-module (gnu packages icu4c)
@@ -67,6 +69,7 @@ (define-module (gnu packages search)
   #:use-module (gnu packages qt)
   #:use-module (gnu packages sphinx)
   #:use-module (gnu packages time)
+  #:use-module (gnu packages tls)
   #:use-module (gnu packages web)
   #:use-module (gnu packages xdisorg)
   #:use-module (gnu packages xml)
@@ -729,4 +732,62 @@ (define-public ugrep
 formats.")
     (license license:bsd-3)))

+(define-public dataparksearch
+  (package
+    (name "dataparksearch")
+    (version "4.54-2016-12-03")
+    (source (origin
+              (method git-fetch)
+              (uri (git-reference
+                    (url "https://github.com/Maxime2/dataparksearch")
+                    (commit version)))
+              (sha256
+               (base32 
"1g5kxw2d8jbc1h9yyy0xpnd3gkscj5a32k6hk3brvdwcbsnjbgyg"))
+              (modules '((guix build utils)))
+              (snippet
+               #~(begin
+                   (for-each delete-file-recursively '("config.sub"
+                                                       "config.guess"
+                                                       "configure"
+                                                       "Makefile.in"
+                                                       "missing"
+                                                       "depcomp"
+                                                       "ltmain.sh"
+                                                       "compile"
+                                                       ))))
+              (file-name (git-file-name name version))))
+    (build-system gnu-build-system)
+    (native-inputs
+     (list pkg-config automake autoconf libtool openjade))
+    (inputs
+     (list mbedtls-apache zlib postgresql aspell c-ares libextractor))
+    (arguments
+     (list #:configure-flags
+           #~(list "--disable-syslog"
+                   "--with-gnu-ld"
+                   "--with-extra-charsets=all"
+                   (string-append "--with-pgsql=" #$(this-package-input 
"postgresql")))
+           #:make-flags
+           #~(list 
"DPS_TEST_DBADDR=postgresql://localhost/tmp/postgresql/")
+           #:phases
+           #~(modify-phases %standard-phases
+               (add-before 'bootstrap 'fix-configure
+                 (lambda _
+                   (substitute* "configure.ac"
+                     (("MY_DIRS=\"/usr/local/include")
+                      (string-append "MY_DIRS=\"" #$(this-package-input 
"aspell") "/include"))
+                     (("MY_DIRS=\"/usr/lib")
+                      (string-append "MY_DIRS=\"" #$(this-package-input 
"aspell") "/lib"))))))))
+    (synopsis "Libre, feature rich search engine")
+    (description
+     "Dataparksearch is a full featured web search engine.
+It has support for http, https, ftp (passive mode), nntp and news URL 
schemes, and
+other URL schemes with external parsers.
+It can tweak URLs with session IDs and other weird formats,
+including some JavaScript link decoding.
+Options to query with all words, all words near to each others, any 
words, or Boolean queries.
+A subset of VQL (Verity Query Language) is supported.")
+    (home-page "https://www.dataparksearch.org/ ")
+    (license license:gpl3+)))
+
 ;;; search.scm ends here
-- 
2.38.0




Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sat, 18 Feb 2023 01:45:01 GMT) Full text and rfc822 format available.

Message #8 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Tobias Geerinckx-Rice <me <at> tobias.gr>
To: Adam Faiz <adam.faiz <at> disroot.org>
Cc: 59355 <at> debbugs.gnu.org, guix-patches <at> gnu.org
Subject: Re: [bug#59355] [PATCH] gnu: Add dataparksearch.
Date: Sat, 18 Feb 2023 01:13:24 +0100
[Message part 1 (text/plain, inline)]
Hi Adam,

Adam Faiz via Guix-patches via 写道:
> * gnu/packages/search.scm (dataparksearch): New variable.

Thanks!  I've applied the patch locally but stopped short of 
pushing.  I have several questions and [notes]:

> +(define-public dataparksearch
> +  (package
> +    (name "dataparksearch")
> +    (version "4.54-2016-12-03")

I don't think this tag is anything special compared to other 
commits.  Is it?

There are many more recent commits, up to 2022.  If the argument 
is ‘4.53 is ancient’ (it is!), should we just package the latest 
commit?

Would the latest release (4.53) instead still be useful today?

> +    (source (origin
> +              (method git-fetch)
> +              (uri (git-reference
> +                    (url 
> "https://github.com/Maxime2/dataparksearch")
> +                    (commit version)))
> +              (sha256
> +               (base32
> "1g5kxw2d8jbc1h9yyy0xpnd3gkscj5a32k6hk3brvdwcbsnjbgyg"))
> +              (modules '((guix build utils)))
> +              (snippet
> +               #~(begin
> +                   (for-each delete-file-recursively 
> '("config.sub"
> + 
> "config.guess"
> + 
> "configure"

[All checked into Git.  Nice.  This snippet is ‘ugly’ because it 
addresses an ugly problem.  I think I'll keep it but add an 
apologetic comment.]

> + 
> "Makefile.in"
> + 
> "missing"
> + 
> "depcomp"
> + 
> "ltmain.sh"
> + 
> "compile"
> +                                                       ))))

[We don't dangle brackets like this but keep them on the previous 
line.]

> +              (file-name (git-file-name name version))))
> +    (build-system gnu-build-system)
> +    (native-inputs
> +     (list pkg-config automake autoconf libtool openjade))
> +    (inputs
> +     (list mbedtls-apache zlib postgresql aspell c-ares 
> libextractor))

[For consistency with other packages, I've moved the …inputs below 
the arguments field.]

> +     (list #:configure-flags
> +           #~(list "--disable-syslog"

Is this not useful?  If not, could you provide a short comment 
explaining why?

> +                   "--with-gnu-ld"

And here?

> +               (add-before 'bootstrap 'fix-configure
> +                 (lambda _
> +                   (substitute* "configure.ac"
> +                     (("MY_DIRS=\"/usr/local/include")
> +                      (string-append "MY_DIRS=\""
> #$(this-package-input "aspell") "/include"))
> +                     (("MY_DIRS=\"/usr/lib")
> +                      (string-append "MY_DIRS=\""
> #$(this-package-input "aspell") "/lib"))))))))

Why not use --with-aspell=… as you did for Postgres?

> +    (synopsis "Libre, feature rich search engine")

[We don't need to include ‘libre’ (or ‘Free’, ‘open-source’, …) in 
descriptions: there's no other software in Guix.  \o/ ]

> +    (description
> +     "Dataparksearch is a full featured web search engine.
> +It has support for http, https, ftp (passive mode), nntp and 
> news URL
> schemes, and
> +other URL schemes with external parsers.
> +It can tweak URLs with session IDs and other weird formats,
> +including some JavaScript link decoding.
> +Options to query with all words, all words near to each others, 
> any
> words, or Boolean queries.
> +A subset of VQL (Verity Query Language) is supported.")
> +    (home-page "https://www.dataparksearch.org/ ")

[Extra trailing ‘ ’ removed.]

> +    (license license:gpl3+)))

Why GPL3+?

[I was unable to get this package to build reproducibly, although 
I tried only disabling #:parallel-build?.]

Kind regards,

T G-R
[signature.asc (application/pgp-signature, inline)]

Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sat, 18 Feb 2023 01:45:02 GMT) Full text and rfc822 format available.

Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sat, 18 Feb 2023 11:38:02 GMT) Full text and rfc822 format available.

Message #14 received at 59355 <at> debbugs.gnu.org (full text, mbox):

From: Adam Faiz <adam.faiz <at> disroot.org>
To: Tobias Geerinckx-Rice <me <at> tobias.gr>
Cc: 59355 <at> debbugs.gnu.org
Subject: Re: [bug#59355] [PATCH] gnu: Add dataparksearch.
Date: Sat, 18 Feb 2023 19:37:27 +0800
On 2/18/23 08:13, Tobias Geerinckx-Rice wrote:
> Hi Adam,
> 
> Adam Faiz via Guix-patches via 写道:
>> * gnu/packages/search.scm (dataparksearch): New variable.
> 
> Thanks!  I've applied the patch locally but stopped short of pushing.  I 
> have several questions and [notes]:
> 
>> +(define-public dataparksearch
>> +  (package
>> +    (name "dataparksearch")
>> +    (version "4.54-2016-12-03")
> 
> I don't think this tag is anything special compared to other commits.  
> Is it?
> 
> There are many more recent commits, up to 2022.  If the argument is 
> ‘4.53 is ancient’ (it is!), should we just package the latest commit?
> Would the latest release (4.53) instead still be useful today?
Not really, I think it's best to package the latest commit.

>> +    (source (origin
>> +              (method git-fetch)
>> +              (uri (git-reference
>> +                    (url "https://github.com/Maxime2/dataparksearch")
>> +                    (commit version)))
>> +              (sha256
>> +               (base32
>> "1g5kxw2d8jbc1h9yyy0xpnd3gkscj5a32k6hk3brvdwcbsnjbgyg"))
>> +              (modules '((guix build utils)))
>> +              (snippet
>> +               #~(begin
>> +                   (for-each delete-file-recursively '("config.sub"
>> + "config.guess"
>> + "configure"
> 
> [All checked into Git.  Nice.  This snippet is ‘ugly’ because it 
> addresses an ugly problem.  I think I'll keep it but add an apologetic 
> comment.]
Sure.

>> + "Makefile.in"
>> + "missing"
>> + "depcomp"
>> + "ltmain.sh"
>> + "compile"
>> +                                                       ))))
> 
> [We don't dangle brackets like this but keep them on the previous line.]
I dangled the brackets because I can't keep track of the brackets when I 
was working on the package. It's about time I try out paredit or some 
solution that colours each bracket.

>> +              (file-name (git-file-name name version))))
>> +    (build-system gnu-build-system)
>> +    (native-inputs
>> +     (list pkg-config automake autoconf libtool openjade))
>> +    (inputs
>> +     (list mbedtls-apache zlib postgresql aspell c-ares libextractor

> [For consistency with other packages, I've moved the …inputs below the 
> arguments field.]
Alright, my logic was to specify the inputs used before running the 
build commands since that's the normal flow for a packager.

> 
>> +     (list #:configure-flags
>> +           #~(list "--disable-syslog"
> 
> Is this not useful?  If not, could you provide a short comment 
> explaining why?
It's useful for people who use syslog, but syslog itself is suboptimal 
for its purpose. More info can be found here:
http://skarnet.org/software/s6/s6-log.html#diesyslogdiedie
Since the Shepherd (and other init systems) can keep the log output of 
the services it supervises, I was hoping that could be used instead.

>> +                   "--with-gnu-ld"
> 
> And here?
That's unnecessary, I just wanted to try it out for fun.

> 
>> +               (add-before 'bootstrap 'fix-configure
>> +                 (lambda _
>> +                   (substitute* "configure.ac"
>> +                     (("MY_DIRS=\"/usr/local/include")
>> +                      (string-append "MY_DIRS=\""
>> #$(this-package-input "aspell") "/include"))
>> +                     (("MY_DIRS=\"/usr/lib")
>> +                      (string-append "MY_DIRS=\""
>> #$(this-package-input "aspell") "/lib"))))))))
> 
> Why not use --with-aspell=… as you did for Postgres?
It's not available in 4.53, which is another reason why the latest 
commit should be used.

>> +    (license license:gpl3+)))
> 
> Why GPL3+?
It was a mistake.

> [I was unable to get this package to build reproducibly, although I 
> tried only disabling #:parallel-build?.]
I'll send an updated patch which fixes the issues you mentioned.

> Kind regards,
> 
> T G-R





Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sun, 19 Feb 2023 00:30:02 GMT) Full text and rfc822 format available.

Message #17 received at 59355 <at> debbugs.gnu.org (full text, mbox):

From: Adam Faiz <adam.faiz <at> disroot.org>
To: Tobias Geerinckx-Rice <me <at> tobias.gr>
Cc: 59355 <at> debbugs.gnu.org, guix-patches <at> gnu.org
Subject: [PATCH v1] gnu: Add dataparksearch.
Date: Sun, 19 Feb 2023 08:29:44 +0800
From 10067c9599d06ea2518b0adb561eaf2f825ae1a5 Mon Sep 17 00:00:00 2001
Message-Id: 
<10067c9599d06ea2518b0adb561eaf2f825ae1a5.1676766217.git.adam.faiz <at> disroot.org>
From: AwesomeAdam54321 <adam.faiz <at> disroot.org>
Date: Fri, 18 Nov 2022 08:55:02 +0800
Subject: [PATCH 1/6] gnu: Add dataparksearch.

* gnu/packages/search.scm (dataparksearch): New variable.
---
 gnu/packages/search.scm | 55 +++++++++++++++++++++++++++++++++++++++++
 1 file changed, 55 insertions(+)

diff --git a/gnu/packages/search.scm b/gnu/packages/search.scm
index fe69fd3775..262f63821d 100644
--- a/gnu/packages/search.scm
+++ b/gnu/packages/search.scm
@@ -40,6 +40,7 @@ (define-module (gnu packages search)
   #:use-module (guix build-system python)
   #:use-module (guix build-system meson)
   #:use-module (gnu packages)
+  #:use-module (gnu packages adns)
   #:use-module (gnu packages aspell)
   #:use-module (gnu packages autotools)
   #:use-module (gnu packages base)
@@ -51,6 +52,7 @@ (define-module (gnu packages search)
   #:use-module (gnu packages gawk)
   #:use-module (gnu packages gettext)
   #:use-module (gnu packages glib)
+  #:use-module (gnu packages gnunet)
   #:use-module (gnu packages groff)
   #:use-module (gnu packages gtk)
   #:use-module (gnu packages icu4c)
@@ -67,6 +69,7 @@ (define-module (gnu packages search)
   #:use-module (gnu packages qt)
   #:use-module (gnu packages sphinx)
   #:use-module (gnu packages time)
+  #:use-module (gnu packages tls)
   #:use-module (gnu packages web)
   #:use-module (gnu packages xdisorg)
   #:use-module (gnu packages xml)
@@ -328,6 +331,58 @@ (define-public bool
 for parsing HTML files.")
     (license license:gpl3+)))

+(define-public dataparksearch
+  (let ((commit "8efa28f31ce1273c0556fd5c7e06abe955197a69")
+             (revision "0"))
+  (package
+    (name "dataparksearch")
+    (version (string-append "4.54" revision "." (string-take commit 7) ))
+    (source (origin
+              (method git-fetch)
+              (uri (git-reference
+                    (url "https://github.com/Maxime2/dataparksearch")
+                    (commit commit)))
+              (sha256
+               (base32 
"01z7s3ws5px2p9brzrq9j41jbdh1cvj8n8y3ghx45gfv1n319ipg"))
+              (modules '((guix build utils)))
+              (snippet
+               #~(begin
+                   (for-each delete-file '("config.sub"
+                                           "config.guess"
+                                           "configure"
+                                           "Makefile.in"
+                                           "missing"
+                                           "depcomp"
+                                           "ltmain.sh"
+                                           "compile"))))
+              (file-name (git-file-name name version))))
+    (build-system gnu-build-system)
+    (arguments
+     (list #:configure-flags
+           #~(list "--disable-syslog"
+                   "--with-extra-charsets=all"
+                   (string-append "--with-aspell="
+                                  #$(this-package-input "aspell"))
+                   (string-append "--with-pgsql="
+                                  #$(this-package-input "postgresql")))
+           #:make-flags
+           #~(list 
"DPS_TEST_DBADDR=postgresql://localhost/tmp/postgresql/")))
+    (native-inputs
+     (list pkg-config automake autoconf libtool openjade))
+    (inputs
+     (list mbedtls-apache zlib postgresql aspell c-ares libextractor))
+    (synopsis "Feature rich search engine")
+    (description
+     "Dataparksearch is a full featured web search engine.
+It has support for http, https, ftp (passive mode), nntp and news URL 
schemes, and
+other URL schemes with external parsers.
+It can tweak URLs with session IDs and other weird formats,
+including some JavaScript link decoding.
+Options to query with all words, all words near to each others, any 
words, or Boolean queries.
+A subset of VQL (Verity Query Language) is supported.")
+    (home-page "https://www.dataparksearch.org/")
+    (license license:gpl2+))))
+
 (define-public fsearch
   (package
     (name "fsearch")

base-commit: 08edbd2535ae622d319a51e6f877d23d75dc24f3
prerequisite-patch-id: 66a677452cec5da7e4db37327b344d02556968f6
prerequisite-patch-id: e13c3b4b4ed06ce521ff679ec64e3cd816183388
prerequisite-patch-id: 1eb4d567a14802495169b3e400e02de0b4941729
-- 
2.39.1




Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sun, 19 Feb 2023 00:30:03 GMT) Full text and rfc822 format available.

Information forwarded to guix-patches <at> gnu.org:
bug#59355; Package guix-patches. (Sat, 22 Apr 2023 10:04:02 GMT) Full text and rfc822 format available.

Message #23 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Nicolas Goaziou <mail <at> nicolasgoaziou.fr>
To: Adam Faiz via Guix-patches via <guix-patches <at> gnu.org>
Cc: 59355-done <at> debbugs.gnu.org, Tobias Geerinckx-Rice <me <at> tobias.gr>,
 Adam Faiz <adam.faiz <at> disroot.org>
Subject: Re: [bug#59355] [PATCH v1] gnu: Add dataparksearch.
Date: Sat, 22 Apr 2023 12:03:41 +0200
Hello,

Adam Faiz via Guix-patches via <guix-patches <at> gnu.org> writes:

>  From 10067c9599d06ea2518b0adb561eaf2f825ae1a5 Mon Sep 17 00:00:00 2001
> Message-Id: 
> <10067c9599d06ea2518b0adb561eaf2f825ae1a5.1676766217.git.adam.faiz <at> disroot.org>
> From: AwesomeAdam54321 <adam.faiz <at> disroot.org>
> Date: Fri, 18 Nov 2022 08:55:02 +0800
> Subject: [PATCH 1/6] gnu: Add dataparksearch.

Thank you.

I removed ""--disable-syslog" since there was no clear consensus about
it (we can always add it again later, of course), reordered inputs
alphabetically, fixed indentation, added a copyright line for you in the
module and applied your patch.

Regards,
-- 
Nicolas Goaziou




Reply sent to Nicolas Goaziou <mail <at> nicolasgoaziou.fr>:
You have taken responsibility. (Sat, 22 Apr 2023 10:04:02 GMT) Full text and rfc822 format available.

Notification sent to Adam Faiz <adam.faiz <at> disroot.org>:
bug acknowledged by developer. (Sat, 22 Apr 2023 10:04:03 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Sat, 20 May 2023 11:24:10 GMT) Full text and rfc822 format available.

This bug report was last modified 2 years and 29 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.