GNU bug report logs - #48211
28.0.50; eww strips whitespace between <mark> elements

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: emacs; Reported by: Stefan Kangas <stefan@HIDDEN>; dated Mon, 3 May 2021 23:17:02 UTC; Maintainer for emacs is bug-gnu-emacs@HIDDEN.

Message received at 48211 <at> debbugs.gnu.org:


Received: (at 48211) by debbugs.gnu.org; 4 May 2021 00:51:15 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 20:51:15 2021
Received: from localhost ([127.0.0.1]:49703 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1ldjHP-000162-CY
	for submit <at> debbugs.gnu.org; Mon, 03 May 2021 20:51:15 -0400
Received: from mail-pl1-f178.google.com ([209.85.214.178]:46702)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <stefankangas@HIDDEN>) id 1ldjHN-00015u-74
 for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 20:51:13 -0400
Received: by mail-pl1-f178.google.com with SMTP id s20so3857412plr.13
 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 17:51:13 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:from:in-reply-to:references:mime-version:date
 :message-id:subject:to:cc:content-transfer-encoding;
 bh=yF4ftS2VNtPYLPylv7b6y7fLPuHnJAlC8xbcdjCog3s=;
 b=Rkkyxa9yK8+gLcZYJgY0wvdgxfaijw3WzAaNtCF/vvkBoYzD9lm3AoX/28TSxTa7zd
 oobTdfF4FkfFQ/OWuYdXMBj8DEjZl8JlVAjzGzYDWa3k4DVoJR0Rn/Lnk5I0E4K7BA0n
 87jW1dOca8p5OPhpUPzeuHjGMMkPus6NbNuXZY5SKz4wNi0cQW3FXV8dBvGRgjMZJWm9
 0EHhgQ8Zgh+bXWPDR4/tPNYFM047zTCs94CS5U2xUt3uWaeMO5XXg4vNtRW0S5MzX5B1
 98ePo27PjWBBK2N8n7NkotyKBELpx12ewJN1FNOKSBnuf5pVQGomKNlfsBpn+GnxKRhn
 E9uA==
X-Gm-Message-State: AOAM530d355bOTeuhyZHKjHhUWLGdbTi65SayvTJrezMsUy3+bdc8HVd
 MI2xTPKygTP1zMttrckhksZiyNUPz8xjVAP5u1E=
X-Google-Smtp-Source: ABdhPJymChpB0l3tQs1BlJltnQzemjIujJe1lXEwXb11RugSWraMT39rVQ+kwzjJYHfvoi3YKGq5JmCLU3U5wtPvGeE=
X-Received: by 2002:a17:902:24d:b029:ee:df5b:286 with SMTP id
 71-20020a170902024db02900eedf5b0286mr5324720plc.39.1620089467404; Mon, 03 May
 2021 17:51:07 -0700 (PDT)
Received: from 753933720722 named unknown by gmailapi.google.com with
 HTTPREST; Mon, 3 May 2021 19:51:06 -0500
From: Stefan Kangas <stefan@HIDDEN>
In-Reply-To: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN>
References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN>
 <87y2cvl6eg.fsf@HIDDEN>
 <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN>
MIME-Version: 1.0
Date: Mon, 3 May 2021 19:51:06 -0500
Message-ID: <CADwFkm=DB958HSh1qsG66bYAFH7wP-u316rz=adT6rNz2vB_Sg@HIDDEN>
Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements
To: "Basil L. Contovounesios" <contovob@HIDDEN>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: 0.5 (/)
X-Debbugs-Envelope-To: 48211
Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.5 (/)

Stefan Kangas <stefan@HIDDEN> writes:

> FWIW, the below diff works around this bug for me.
>
> diff --git a/lisp/net/shr.el b/lisp/net/shr.el
> index cbdeb65ba8..3eb3a5bc49 100644
> --- a/lisp/net/shr.el
> +++ b/lisp/net/shr.el
> @@ -1485,6 +1485,12 @@ shr-tag-tt
>    ;; The `tt' tag is deprecated in favor of `code'.
>    (shr-tag-code dom))
>
> +(defun shr-tag-mark (dom)
> +  (shr-generic dom)
> +  ;; Hack to work around bug in libxml2 (Bug#48211):
> +  ;; https://gitlab.gnome.org/GNOME/libxml2/-/issues/247
> +  (insert " "))
> +
>  (defun shr-tag-ins (cont)
>    (let* ((start (point))
>           (color "green")

Well, I should moderate that statement.

It doesn't exactly fix the bug as I'm now getting this instead:

    1. f. Unidad ling=C3=BC=C3=ADstica , dotada generalmente de significado=
 , que
    se separa de las dem=C3=A1s mediante pausas potenciales en la
    pronunciaci=C3=B3n y blancos en la escritura .

    2. f. Representaci=C3=B3n gr=C3=A1fica de la palabra hablada .

    3. f. Facultad de hablar .

IOW, whitespace is added even if the following character is
punctuation...




Information forwarded to bug-gnu-emacs@HIDDEN:
bug#48211; Package emacs. Full text available.

Message received at 48211 <at> debbugs.gnu.org:


Received: (at 48211) by debbugs.gnu.org; 4 May 2021 00:35:43 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 20:35:43 2021
Received: from localhost ([127.0.0.1]:49617 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1ldj2N-0000wo-Jc
	for submit <at> debbugs.gnu.org; Mon, 03 May 2021 20:35:43 -0400
Received: from mail-pg1-f173.google.com ([209.85.215.173]:34324)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <stefankangas@HIDDEN>) id 1ldj2M-0000wh-8y
 for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 20:35:42 -0400
Received: by mail-pg1-f173.google.com with SMTP id z16so5153580pga.1
 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 17:35:42 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:from:in-reply-to:references:user-agent
 :mime-version:date:message-id:subject:to:cc;
 bh=mlFBIx3IaBhe7uQixLd2j9mDE6F85fKP2vS/PPkedtQ=;
 b=aa+6FLYaLSdSoZ3xjFN8nNWiJ3QpEQevvKM1HAUj7BV8NG0APpCVcaVhU9e+O8Xy64
 7gVWnY6HGa6A8TfNLXnCja54O4HEnqGTL+0bwuUqHKsCDiZY9u5b/+WQOw5nXqXFPQn/
 wkUwh6o4sTcXrtvy+6aaNxcVi1s+ohtMSG0AZOK8bzBMizt/yPkHIAnCEtw++jEiFde1
 eBykucpj5YG7viX8xIyJkQTsiBL9XLmwkJoovP/YFxhP7J+xWc7Kt+QCw1jfEqothJgh
 7mlt3vS7qO57DSK+ue6V3kZ8tv5zKPFubmQDD7/n9C8UCQdvHBubdcD6uTBijDJ/ZZla
 /JXw==
X-Gm-Message-State: AOAM532XEwro65Sqx7YqUA/VbF2A9aOtuGf0gsCR/MplWNvjTyb/DBQO
 Zef27l2T6wziBJFJg87Rpz+o48q2NLq1Mz8YBfM=
X-Google-Smtp-Source: ABdhPJzrRunWXyTuiSAGfMceq9zGy2dnFw/5pIDRwiA1mR1BAWECxah6Xxq+kMwkqo/hI8Biunm5xCeTrFpGaFhX2ro=
X-Received: by 2002:aa7:8113:0:b029:278:dfa6:8ac with SMTP id
 b19-20020aa781130000b0290278dfa608acmr21184109pfi.57.1620088536477; Mon, 03
 May 2021 17:35:36 -0700 (PDT)
Received: from 753933720722 named unknown by gmailapi.google.com with
 HTTPREST; Mon, 3 May 2021 19:35:35 -0500
From: Stefan Kangas <stefan@HIDDEN>
In-Reply-To: <87y2cvl6eg.fsf@HIDDEN> (Basil L. Contovounesios's message of
 "Tue, 04 May 2021 00:55:03 +0100")
References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN>
 <87y2cvl6eg.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux)
MIME-Version: 1.0
Date: Mon, 3 May 2021 19:35:35 -0500
Message-ID: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN>
Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements
To: "Basil L. Contovounesios" <contovob@HIDDEN>
Content-Type: text/plain; charset="UTF-8"
X-Spam-Score: 0.5 (/)
X-Debbugs-Envelope-To: 48211
Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.5 (/)

"Basil L. Contovounesios" <contovob@HIDDEN> writes:

> I think this is because libxml-parse-html-region specifies
> HTML_PARSE_NOBLANKS:
>
> Return CDATA sections (like <style>foo</style>) as text nodes.
> 3c2317e891 2010-12-06 17:59:52 +0100
> https://git.sv.gnu.org/cgit/emacs.git/commit/?id=3c2317e89100833812a7194c0d9d39ae0f52cb33

Hmm, okay.  For now, I'm seeing this issue with basically any tag that
libxml2 does not already know about, e.g. "<summary>" or "<bdi>".

This is what I came up with before reading Basil's reply:

(with-temp-buffer
  (insert "<p><tt>foo</tt> <tt>bar</tt></p>")
  (libxml-parse-html-region (point-min) (point-max)))

=> (html nil (body nil (p nil (tt nil "foo") " " (tt nil "bar"))))

(with-temp-buffer
  (insert "<p><mark>foo</mark> <mark>bar</mark></p>")
  (libxml-parse-html-region (point-min) (point-max)))

=> (html nil (body nil (p nil (mark nil "foo") (mark nil "bar"))))

I guess this is a bug in libxml2, so I reported it here:

    https://gitlab.gnome.org/GNOME/libxml2/-/issues/247

FWIW, the below diff works around this bug for me.

diff --git a/lisp/net/shr.el b/lisp/net/shr.el
index cbdeb65ba8..3eb3a5bc49 100644
--- a/lisp/net/shr.el
+++ b/lisp/net/shr.el
@@ -1485,6 +1485,12 @@ shr-tag-tt
   ;; The `tt' tag is deprecated in favor of `code'.
   (shr-tag-code dom))

+(defun shr-tag-mark (dom)
+  (shr-generic dom)
+  ;; Hack to work around bug in libxml2 (Bug#48211):
+  ;; https://gitlab.gnome.org/GNOME/libxml2/-/issues/247
+  (insert " "))
+
 (defun shr-tag-ins (cont)
   (let* ((start (point))
          (color "green")




Information forwarded to bug-gnu-emacs@HIDDEN:
bug#48211; Package emacs. Full text available.
bug Marked as found in versions 24.1. Request was from "Basil L. Contovounesios" <contovob@HIDDEN> to control <at> debbugs.gnu.org. Full text available.

Message received at 48211 <at> debbugs.gnu.org:


Received: (at 48211) by debbugs.gnu.org; 3 May 2021 23:55:13 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 19:55:13 2021
Received: from localhost ([127.0.0.1]:49450 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1ldiPB-0006sL-IM
	for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:55:13 -0400
Received: from mail-wm1-f46.google.com ([209.85.128.46]:41672)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <contovob@HIDDEN>) id 1ldiP9-0006s3-OG
 for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 19:55:12 -0400
Received: by mail-wm1-f46.google.com with SMTP id
 t11-20020a05600c198bb02901476e13296aso309404wmq.0
 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 16:55:11 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tcd.ie; s=google21;
 h=from:to:cc:subject:references:date:in-reply-to:message-id
 :user-agent:mime-version;
 bh=ygF/jluHMwhbX2enFnBIuiuUMQRt1NyunNSNOXI9ojE=;
 b=G/UW60gD/nwTQYsxgW6jOmT3BH4nYgawtd6yfIL6Gkv3rbqBX3bLIGRhIUM3wzbt8P
 eAiIXFpONdibND1NPRzZeijqaf4NWFvy0YB25OjaSBiC7XHm3GRCLegdYicVFA7QbZU7
 XIoczsJwsdbUHiYzHnbOrGHuhnzYQvhUIyMrAuUL4Tl1r1fHyTnYJP4ArhFpPTpuZrEa
 jjixaoRB1l+dLO0XUpHFnIRMvuAk16Br67QcMUM7KRVE0ko50Dppx7q6gTPWMa6g+ZCH
 jkmk1s1aXXCGQy4o5cbGU2pvNC2d3C3sTyqe2toTAKWEa70JBcSArCJHMSGqrWcQW/Yc
 K1HQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to
 :message-id:user-agent:mime-version;
 bh=ygF/jluHMwhbX2enFnBIuiuUMQRt1NyunNSNOXI9ojE=;
 b=nasB2baPoH/WyrINfcY12+NBxMFzZ1TOAaTxDU+tJ2w3K9GUyO6QlNI5eaygV7CGmw
 8hNoeeTk2xn4DPfHZYln5N0+s/Ni7ASq+V1Wi64Y8XeB7A29lryPbvG8moG0Oa3N+xit
 yh89E9sqw+rA9OuFUQpu3rTVgqqBk0VSQ3+mG2XuGwQw6Kbwv9ddinCW2wv69VEqOnCz
 64pPKofrgMAYr8fG7SGCwwYClaUrMdfXjEff94fXFCQMzr0XzLPkYzJqNYUhZUmgLxvF
 Jb4A4gi2PjKbBT3OF+gg8XLMPWHntt7SvlHazaY4UnEyEE0TC8OvJqSKA06WwyPkMiSu
 m9Uw==
X-Gm-Message-State: AOAM533Kb3oArMlXlh9EnTrM7FNSBqOLv+sMdvO/QcOoUXdJotnELFFV
 JWBt1cZlCOf5g51acoswoMdNgA==
X-Google-Smtp-Source: ABdhPJwOgFcaAM1SqJAGPDVFQHTHhzSQvoeXR7d5zK50qM/4oGA1rq1uhxul/Uqe9SHPgzfknrF7cg==
X-Received: by 2002:a7b:c38d:: with SMTP id s13mr7305541wmj.115.1620086105824; 
 Mon, 03 May 2021 16:55:05 -0700 (PDT)
Received: from localhost ([64.43.159.102])
 by smtp.gmail.com with ESMTPSA id f4sm14215911wrz.33.2021.05.03.16.55.05
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Mon, 03 May 2021 16:55:05 -0700 (PDT)
From: "Basil L. Contovounesios" <contovob@HIDDEN>
To: Stefan Kangas <stefan@HIDDEN>
Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements
References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN>
Date: Tue, 04 May 2021 00:55:03 +0100
In-Reply-To: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN>
 (Stefan Kangas's message of "Mon, 3 May 2021 18:16:06 -0500")
Message-ID: <87y2cvl6eg.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain
X-Spam-Score: -0.0 (/)
X-Debbugs-Envelope-To: 48211
Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

found 48211 24.1
quit

Stefan Kangas <stefan@HIDDEN> writes:

> Opening a HTML file in eww with <mark> elements strips whitespace
> between elements.

I think this is because libxml-parse-html-region specifies
HTML_PARSE_NOBLANKS:

Return CDATA sections (like <style>foo</style>) as text nodes.
3c2317e891 2010-12-06 17:59:52 +0100
https://git.sv.gnu.org/cgit/emacs.git/commit/?id=3c2317e89100833812a7194c0d9d39ae0f52cb33

-- 
Basil




Information forwarded to bug-gnu-emacs@HIDDEN:
bug#48211; Package emacs. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 3 May 2021 23:16:18 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 19:16:18 2021
Received: from localhost ([127.0.0.1]:49290 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1ldhnW-0006W4-BH
	for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:16:18 -0400
Received: from lists.gnu.org ([209.51.188.17]:56234)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <stefankangas@HIDDEN>) id 1ldhnU-0006Vy-To
 for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:16:17 -0400
Received: from eggs.gnu.org ([2001:470:142:3::10]:37472)
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <stefankangas@HIDDEN>)
 id 1ldhnS-00085r-Hp
 for bug-gnu-emacs@HIDDEN; Mon, 03 May 2021 19:16:16 -0400
Received: from mail-pg1-f182.google.com ([209.85.215.182]:43759)
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.90_1) (envelope-from <stefankangas@HIDDEN>)
 id 1ldhnN-0008Cv-CG
 for bug-gnu-emacs@HIDDEN; Mon, 03 May 2021 19:16:11 -0400
Received: by mail-pg1-f182.google.com with SMTP id p12so4962118pgj.10
 for <bug-gnu-emacs@HIDDEN>; Mon, 03 May 2021 16:16:08 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:from:mime-version:date:message-id:subject:to
 :content-transfer-encoding;
 bh=/doHh2CKegbzQT595Nm5vpoZhuU/9f+Xrb3SSzyb5gY=;
 b=h7MsXgi467sbvPsDdSA+jEjcT5AMZ/cpD6YWNBO0Rtn6IVt64u+WrJxKSLwuZgg1ZE
 QRU5My8VzKCBA096XueC/6Q7RQ5JNSq44SIJqY9qu5FkKHqkipLPXLs3EXEvIXvV1U0r
 N0nujVakm1t23c8EsJyLVFWh12N9c2+wsGYMVqnR0x1MfehGot7XOFc7ZtNV67yblF2i
 Qyfr4luO8yOAQvw5XfSdKqYvuzWj7UA7Ih9pweTqSgcCP1yvRd/7JcVBv7WbtHPBvXcZ
 sBpyxeKAsLq4CuKQ+Qsw/sgRm6Ed9eGkLQvoMe/4y70PuoXmwOUkgs6YJ5Sq5//CUWWk
 fVEA==
X-Gm-Message-State: AOAM532a1+La7bdhehjcO9dO1b4KYdwfYxKWK30fJ3IuhV4IwedpZMeA
 XNdzN3rEKQupmG1mNvip4OpC9zG/LU98JKNVBr1Rj3Ou
X-Google-Smtp-Source: ABdhPJy82PlmXySLB/aa4tSidUctvZJV69erIXGLg2iwtpFOcrZgD2zS1qIpHGzIyysV8aKyfjZO9eQ6qXoaaiNTpaM=
X-Received: by 2002:aa7:828f:0:b029:200:6e27:8c8f with SMTP id
 s15-20020aa7828f0000b02902006e278c8fmr20898511pfm.44.1620083767085; Mon, 03
 May 2021 16:16:07 -0700 (PDT)
Received: from 753933720722 named unknown by gmailapi.google.com with
 HTTPREST; Mon, 3 May 2021 18:16:06 -0500
From: Stefan Kangas <stefan@HIDDEN>
X-Debbugs-CC: Lars Ingebrigtsen <larsi@HIDDEN>
MIME-Version: 1.0
Date: Mon, 3 May 2021 18:16:06 -0500
Message-ID: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN>
Subject: 28.0.50; eww strips whitespace between <mark> elements
To: bug-gnu-emacs@HIDDEN
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Received-SPF: pass client-ip=209.85.215.182;
 envelope-from=stefankangas@HIDDEN; helo=mail-pg1-f182.google.com
X-Spam_score_int: 0
X-Spam_score: 0.0
X-Spam_bar: /
X-Spam_report: (0.0 / 5.0 requ) BAYES_05=-0.5, FREEMAIL_FORGED_FROMDOMAIN=0.249,
 FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.25,
 RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001,
 SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no
X-Spam_action: no action
X-Spam-Score: -0.8 (/)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.8 (-)

Opening a HTML file in eww with <mark> elements strips whitespace
between elements.

Steps to reproduce:

0. echo "<p><mark>foo</mark> <mark>bar</mark></p>" > /tmp/foo.html
1. emacs -Q
2. M-x eww RET file:///tmp/foo.html RET

Result is that I see, in the eww buffer:

    "foobar"

Expected result is:

    "foo bar"

For a real world example where this matters, see:

    https://dle.rae.es/palabra

In eww, I get:

  1. f. Unidadling=C3=BC=C3=ADstica, dotadageneralmentedesignificado,
  queseseparadelasdem=C3=A1smediantepausaspotencialesenlapronunciaci=C3=B3n=
yblancosenlaescritura.

In Firefox, I get:

  1. f. Unidad ling=C3=BC=C3=ADstica, dotada generalmente de significado, q=
ue se
  separa de las dem=C3=A1s mediante pausas potenciales en la pronunciaci=C3=
=B3n y
  blancos en la escritura.




Acknowledgement sent to Stefan Kangas <stefan@HIDDEN>:
New bug report received and forwarded. Copy sent to larsi@HIDDEN, bug-gnu-emacs@HIDDEN. Full text available.
Report forwarded to larsi@HIDDEN, bug-gnu-emacs@HIDDEN:
bug#48211; Package emacs. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Tue, 4 May 2021 01:00:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.