Received: (at 48211) by debbugs.gnu.org; 22 Apr 2025 03:58:08 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon Apr 21 23:58:08 2025 Received: from localhost ([127.0.0.1]:42632 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1u74ls-0003jJ-04 for submit <at> debbugs.gnu.org; Mon, 21 Apr 2025 23:58:08 -0400 Received: from mail-ed1-x531.google.com ([2a00:1450:4864:20::531]:57536) by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.84_2) (envelope-from <stefankangas@HIDDEN>) id 1u74lo-0003iI-Ku for 48211 <at> debbugs.gnu.org; Mon, 21 Apr 2025 23:58:05 -0400 Received: by mail-ed1-x531.google.com with SMTP id 4fb4d7f45d1cf-5e677f59438so6604095a12.2 for <48211 <at> debbugs.gnu.org>; Mon, 21 Apr 2025 20:58:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1745294278; x=1745899078; darn=debbugs.gnu.org; h=cc:to:subject:message-id:date:mime-version:references:in-reply-to :from:from:to:cc:subject:date:message-id:reply-to; bh=Mgv6bFg5Arp8OLeU3bldb31yZZW6fDrS2i6HHc2zVcM=; b=QxT5KdDov1YhldgK5nYxCcIXodX9Nsu4vUoJK6a19Gz4RfVDhPP4/WNPaJJignmpUP M5/DsIfw5E5WIemUi7xuN8IZ668j7FFvc1ulwewlhIW4Gy1/sERc+ozw9SdMzrwTLmMn cUrYUa1mL5dsuQNH1BbP+sdQX+NQKx137toKF47CXvtqhZR8br83oiXUtt9FMcn6PlB5 V8WlfqU1SOjdOUZA7Sq3CizFm3+yP5P0/3WGQFSDbOs5AKUmYSwD/+KKNvgqZ1ug47j4 wM+8FHOsKHAwDkcZdG4TGOhVMUkjL1EOk9/9oiV38mECW0zMlVCE+FwPlFLC5dhXW5oo uHQw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1745294278; x=1745899078; h=cc:to:subject:message-id:date:mime-version:references:in-reply-to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=Mgv6bFg5Arp8OLeU3bldb31yZZW6fDrS2i6HHc2zVcM=; b=Y9lu+gJn16CUISFz3WqR5Vwo6D7oCCP/2sUwryzffRCDDtyt7Q8PIVj0wXvL7jIfLP 87uCKF8k4kx4JBqnzlcoSGG4YXE65/wY4YNnyUtOzf7+qPcQbARuQCyPVWUFxzu7UpJJ xBZjSBrMODPfeUJ/wZnokfXh2776Iobkl3cuf5peykcQJ1OTPmjGD+82lo0W/01+3vao HyPcHfzq+RZ9+jAtwZ9XeHZiuMg/CQmglU3yBxEDKg7hKQqRnOWCcqbslELhpEk5nwVZ jV0U0wQtuxCcQWXPCBcZ/BMpinfxGTBnMWWuU2Ow1u+3q535JM2R7Aa3U9Or71nWNwu9 Q/rw== X-Forwarded-Encrypted: i=1; AJvYcCUFSWakghLH+VuYeDNS44leJeDPF/po7JakkjJJQsKiV+ojTzMu4o8PchFWNkOjoyn6YPZJ7w==@debbugs.gnu.org X-Gm-Message-State: AOJu0Yzb8j36pKRyn3wuFPr541Jr9gwxUsqk0vXZrXvR5kP1X0h9o1C6 3DTF8CUgfBIBBXVYdWUZl3mVkQ+/m1mUEjIM1wTYMMjC8f/J6iQmsAxhUNotkv1NXOG/rf6JG4I Xaa19zABJ4ZzGt/D1kH4D3fymN2UcVYA0 X-Gm-Gg: ASbGncsra9ISq42nHSaOT3piVs/UD91ns9+uj60VOkYKYZ71Janr/AR/yJtx64U8egQ FGBVJ5Y+Vcwq6XlU35o2dpTaszJxqGm/aqRZYH4i7zdlpg7oGhtfB7RsE9uDTGbuzvuX67xGJjF H4Vh0lkPhaNwZoXnZBV1eVxrh49mN1qT3TSGI= X-Google-Smtp-Source: AGHT+IFY1CLvqrZx0GH2Nh4jvR1s26RnTmxRgM/rJa74FtWfiQ7AwRmhRBfuYjQPn1rkI2JHY7vFDOh53g6SabPZQ20= X-Received: by 2002:a05:6402:3547:b0:5e7:110a:c55 with SMTP id 4fb4d7f45d1cf-5f62856df81mr11254439a12.18.1745294278142; Mon, 21 Apr 2025 20:57:58 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 21 Apr 2025 23:57:57 -0400 Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 21 Apr 2025 23:57:57 -0400 From: Stefan Kangas <stefankangas@HIDDEN> In-Reply-To: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> <87y2cvl6eg.fsf@HIDDEN> <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> MIME-Version: 1.0 Date: Mon, 21 Apr 2025 23:57:57 -0400 X-Gm-Features: ATxdqUEKilRocVB5pdDzFXdGkHFV0ztER2JKaymOcdCfXVCmqVpYlGZBaaS9Z90 Message-ID: <CADwFkmmYCQGjzt0PNNa5TLfrhh4LvU78Qqke9+e1LUM=Fbaqww@HIDDEN> Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements To: "Basil L. Contovounesios" <contovob@HIDDEN> Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 48211 Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) > I guess this is a bug in libxml2, so I reported it here: > > https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 Upstream says that this is fixed in 2.14: https://gitlab.gnome.org/GNOME/libxml2/-/issues/247#note_2418422
bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available.Stefan Kangas <stefankangas@HIDDEN>
to control <at> debbugs.gnu.org
.
Full text available.Debbugs Internal Request <help-debbugs@HIDDEN>
to internal_control <at> debbugs.gnu.org
.
Full text available.Lars Ingebrigtsen <larsi@HIDDEN>
to control <at> debbugs.gnu.org
.
Full text available.Received: (at 48211) by debbugs.gnu.org; 1 Jul 2022 11:46:52 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Jul 01 07:46:52 2022 Received: from localhost ([127.0.0.1]:36478 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1o7F6m-0006tB-RZ for submit <at> debbugs.gnu.org; Fri, 01 Jul 2022 07:46:52 -0400 Received: from quimby.gnus.org ([95.216.78.240]:54132) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <larsi@HIDDEN>) id 1o7F6k-0006na-Vv for 48211 <at> debbugs.gnu.org; Fri, 01 Jul 2022 07:46:47 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:Date:References: In-Reply-To:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=tRb3UOeqckL9i6OthShx7IRgFOmoM0uiwvrlFJl7tow=; b=R3s+W1tqhYqhDqTEQQj1hIDd55 8PvsZ+JUwSaIfHcr6f/yiNb8Fkm+Zd659C8fadr+biwP6Jm+1hLQT5PfyBH57qQTpkWC2eCzM4nlp +6Nu+i71gGF9fOjP9LniDbuYUrryRAzg1hcC83YU9urN2rH2fVderB23qBVaJHAG9v8A=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from <larsi@HIDDEN>) id 1o7F6a-0006bN-OD; Fri, 01 Jul 2022 13:46:39 +0200 From: Lars Ingebrigtsen <larsi@HIDDEN> To: Stefan Kangas <stefan@HIDDEN> Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements In-Reply-To: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> (Stefan Kangas's message of "Mon, 3 May 2021 19:35:35 -0500") References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> <87y2cvl6eg.fsf@HIDDEN> <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> X-Now-Playing: Moby's _Reprise: Remixes_: "Go (Anfisa Letyago Remix)" Date: Fri, 01 Jul 2022 13:46:33 +0200 Message-ID: <87sfnl5a3q.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Stefan Kangas <stefan@HIDDEN> writes: > I guess this is a bug in libxml2, so I reported it here: > > https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 48211 Cc: "Basil L. Contovounesios" <contovob@HIDDEN>, 48211 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -3.3 (---) Stefan Kangas <stefan@HIDDEN> writes: > I guess this is a bug in libxml2, so I reported it here: > > https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 [...] > +(defun shr-tag-mark (dom) > + (shr-generic dom) > + ;; Hack to work around bug in libxml2 (Bug#48211): > + ;; https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 > + (insert " ")) I've now pushed a variation of this to Emacs 29, and included a face and stuff, as https://www.w3schools.com/tags/tag_mark.asp recommends. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no
bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available.Received: (at 48211) by debbugs.gnu.org; 4 May 2021 00:51:15 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 20:51:15 2021 Received: from localhost ([127.0.0.1]:49703 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1ldjHP-000162-CY for submit <at> debbugs.gnu.org; Mon, 03 May 2021 20:51:15 -0400 Received: from mail-pl1-f178.google.com ([209.85.214.178]:46702) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <stefankangas@HIDDEN>) id 1ldjHN-00015u-74 for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 20:51:13 -0400 Received: by mail-pl1-f178.google.com with SMTP id s20so3857412plr.13 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 17:51:13 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc:content-transfer-encoding; bh=yF4ftS2VNtPYLPylv7b6y7fLPuHnJAlC8xbcdjCog3s=; b=Rkkyxa9yK8+gLcZYJgY0wvdgxfaijw3WzAaNtCF/vvkBoYzD9lm3AoX/28TSxTa7zd oobTdfF4FkfFQ/OWuYdXMBj8DEjZl8JlVAjzGzYDWa3k4DVoJR0Rn/Lnk5I0E4K7BA0n 87jW1dOca8p5OPhpUPzeuHjGMMkPus6NbNuXZY5SKz4wNi0cQW3FXV8dBvGRgjMZJWm9 0EHhgQ8Zgh+bXWPDR4/tPNYFM047zTCs94CS5U2xUt3uWaeMO5XXg4vNtRW0S5MzX5B1 98ePo27PjWBBK2N8n7NkotyKBELpx12ewJN1FNOKSBnuf5pVQGomKNlfsBpn+GnxKRhn E9uA== X-Gm-Message-State: AOAM530d355bOTeuhyZHKjHhUWLGdbTi65SayvTJrezMsUy3+bdc8HVd MI2xTPKygTP1zMttrckhksZiyNUPz8xjVAP5u1E= X-Google-Smtp-Source: ABdhPJymChpB0l3tQs1BlJltnQzemjIujJe1lXEwXb11RugSWraMT39rVQ+kwzjJYHfvoi3YKGq5JmCLU3U5wtPvGeE= X-Received: by 2002:a17:902:24d:b029:ee:df5b:286 with SMTP id 71-20020a170902024db02900eedf5b0286mr5324720plc.39.1620089467404; Mon, 03 May 2021 17:51:07 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 3 May 2021 19:51:06 -0500 From: Stefan Kangas <stefan@HIDDEN> In-Reply-To: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> <87y2cvl6eg.fsf@HIDDEN> <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> MIME-Version: 1.0 Date: Mon, 3 May 2021 19:51:06 -0500 Message-ID: <CADwFkm=DB958HSh1qsG66bYAFH7wP-u316rz=adT6rNz2vB_Sg@HIDDEN> Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements To: "Basil L. Contovounesios" <contovob@HIDDEN> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 48211 Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -0.5 (/) Stefan Kangas <stefan@HIDDEN> writes: > FWIW, the below diff works around this bug for me. > > diff --git a/lisp/net/shr.el b/lisp/net/shr.el > index cbdeb65ba8..3eb3a5bc49 100644 > --- a/lisp/net/shr.el > +++ b/lisp/net/shr.el > @@ -1485,6 +1485,12 @@ shr-tag-tt > ;; The `tt' tag is deprecated in favor of `code'. > (shr-tag-code dom)) > > +(defun shr-tag-mark (dom) > + (shr-generic dom) > + ;; Hack to work around bug in libxml2 (Bug#48211): > + ;; https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 > + (insert " ")) > + > (defun shr-tag-ins (cont) > (let* ((start (point)) > (color "green") Well, I should moderate that statement. It doesn't exactly fix the bug as I'm now getting this instead: 1. f. Unidad ling=C3=BC=C3=ADstica , dotada generalmente de significado= , que se separa de las dem=C3=A1s mediante pausas potenciales en la pronunciaci=C3=B3n y blancos en la escritura . 2. f. Representaci=C3=B3n gr=C3=A1fica de la palabra hablada . 3. f. Facultad de hablar . IOW, whitespace is added even if the following character is punctuation...
bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available.Received: (at 48211) by debbugs.gnu.org; 4 May 2021 00:35:43 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 20:35:43 2021 Received: from localhost ([127.0.0.1]:49617 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1ldj2N-0000wo-Jc for submit <at> debbugs.gnu.org; Mon, 03 May 2021 20:35:43 -0400 Received: from mail-pg1-f173.google.com ([209.85.215.173]:34324) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <stefankangas@HIDDEN>) id 1ldj2M-0000wh-8y for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 20:35:42 -0400 Received: by mail-pg1-f173.google.com with SMTP id z16so5153580pga.1 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 17:35:42 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=mlFBIx3IaBhe7uQixLd2j9mDE6F85fKP2vS/PPkedtQ=; b=aa+6FLYaLSdSoZ3xjFN8nNWiJ3QpEQevvKM1HAUj7BV8NG0APpCVcaVhU9e+O8Xy64 7gVWnY6HGa6A8TfNLXnCja54O4HEnqGTL+0bwuUqHKsCDiZY9u5b/+WQOw5nXqXFPQn/ wkUwh6o4sTcXrtvy+6aaNxcVi1s+ohtMSG0AZOK8bzBMizt/yPkHIAnCEtw++jEiFde1 eBykucpj5YG7viX8xIyJkQTsiBL9XLmwkJoovP/YFxhP7J+xWc7Kt+QCw1jfEqothJgh 7mlt3vS7qO57DSK+ue6V3kZ8tv5zKPFubmQDD7/n9C8UCQdvHBubdcD6uTBijDJ/ZZla /JXw== X-Gm-Message-State: AOAM532XEwro65Sqx7YqUA/VbF2A9aOtuGf0gsCR/MplWNvjTyb/DBQO Zef27l2T6wziBJFJg87Rpz+o48q2NLq1Mz8YBfM= X-Google-Smtp-Source: ABdhPJzrRunWXyTuiSAGfMceq9zGy2dnFw/5pIDRwiA1mR1BAWECxah6Xxq+kMwkqo/hI8Biunm5xCeTrFpGaFhX2ro= X-Received: by 2002:aa7:8113:0:b029:278:dfa6:8ac with SMTP id b19-20020aa781130000b0290278dfa608acmr21184109pfi.57.1620088536477; Mon, 03 May 2021 17:35:36 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 3 May 2021 19:35:35 -0500 From: Stefan Kangas <stefan@HIDDEN> In-Reply-To: <87y2cvl6eg.fsf@HIDDEN> (Basil L. Contovounesios's message of "Tue, 04 May 2021 00:55:03 +0100") References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> <87y2cvl6eg.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Mon, 3 May 2021 19:35:35 -0500 Message-ID: <CADwFkmkKmqjbLuEQNJD+Z7Q8CFgc2Oia+YUEkPXgSZ7rFx5_ag@HIDDEN> Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements To: "Basil L. Contovounesios" <contovob@HIDDEN> Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 48211 Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -0.5 (/) "Basil L. Contovounesios" <contovob@HIDDEN> writes: > I think this is because libxml-parse-html-region specifies > HTML_PARSE_NOBLANKS: > > Return CDATA sections (like <style>foo</style>) as text nodes. > 3c2317e891 2010-12-06 17:59:52 +0100 > https://git.sv.gnu.org/cgit/emacs.git/commit/?id=3c2317e89100833812a7194c0d9d39ae0f52cb33 Hmm, okay. For now, I'm seeing this issue with basically any tag that libxml2 does not already know about, e.g. "<summary>" or "<bdi>". This is what I came up with before reading Basil's reply: (with-temp-buffer (insert "<p><tt>foo</tt> <tt>bar</tt></p>") (libxml-parse-html-region (point-min) (point-max))) => (html nil (body nil (p nil (tt nil "foo") " " (tt nil "bar")))) (with-temp-buffer (insert "<p><mark>foo</mark> <mark>bar</mark></p>") (libxml-parse-html-region (point-min) (point-max))) => (html nil (body nil (p nil (mark nil "foo") (mark nil "bar")))) I guess this is a bug in libxml2, so I reported it here: https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 FWIW, the below diff works around this bug for me. diff --git a/lisp/net/shr.el b/lisp/net/shr.el index cbdeb65ba8..3eb3a5bc49 100644 --- a/lisp/net/shr.el +++ b/lisp/net/shr.el @@ -1485,6 +1485,12 @@ shr-tag-tt ;; The `tt' tag is deprecated in favor of `code'. (shr-tag-code dom)) +(defun shr-tag-mark (dom) + (shr-generic dom) + ;; Hack to work around bug in libxml2 (Bug#48211): + ;; https://gitlab.gnome.org/GNOME/libxml2/-/issues/247 + (insert " ")) + (defun shr-tag-ins (cont) (let* ((start (point)) (color "green")
bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available."Basil L. Contovounesios" <contovob@HIDDEN>
to control <at> debbugs.gnu.org
.
Full text available.Received: (at 48211) by debbugs.gnu.org; 3 May 2021 23:55:13 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 19:55:13 2021 Received: from localhost ([127.0.0.1]:49450 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1ldiPB-0006sL-IM for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:55:13 -0400 Received: from mail-wm1-f46.google.com ([209.85.128.46]:41672) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <contovob@HIDDEN>) id 1ldiP9-0006s3-OG for 48211 <at> debbugs.gnu.org; Mon, 03 May 2021 19:55:12 -0400 Received: by mail-wm1-f46.google.com with SMTP id t11-20020a05600c198bb02901476e13296aso309404wmq.0 for <48211 <at> debbugs.gnu.org>; Mon, 03 May 2021 16:55:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tcd.ie; s=google21; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version; bh=ygF/jluHMwhbX2enFnBIuiuUMQRt1NyunNSNOXI9ojE=; b=G/UW60gD/nwTQYsxgW6jOmT3BH4nYgawtd6yfIL6Gkv3rbqBX3bLIGRhIUM3wzbt8P eAiIXFpONdibND1NPRzZeijqaf4NWFvy0YB25OjaSBiC7XHm3GRCLegdYicVFA7QbZU7 XIoczsJwsdbUHiYzHnbOrGHuhnzYQvhUIyMrAuUL4Tl1r1fHyTnYJP4ArhFpPTpuZrEa jjixaoRB1l+dLO0XUpHFnIRMvuAk16Br67QcMUM7KRVE0ko50Dppx7q6gTPWMa6g+ZCH jkmk1s1aXXCGQy4o5cbGU2pvNC2d3C3sTyqe2toTAKWEa70JBcSArCJHMSGqrWcQW/Yc K1HQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=ygF/jluHMwhbX2enFnBIuiuUMQRt1NyunNSNOXI9ojE=; b=nasB2baPoH/WyrINfcY12+NBxMFzZ1TOAaTxDU+tJ2w3K9GUyO6QlNI5eaygV7CGmw 8hNoeeTk2xn4DPfHZYln5N0+s/Ni7ASq+V1Wi64Y8XeB7A29lryPbvG8moG0Oa3N+xit yh89E9sqw+rA9OuFUQpu3rTVgqqBk0VSQ3+mG2XuGwQw6Kbwv9ddinCW2wv69VEqOnCz 64pPKofrgMAYr8fG7SGCwwYClaUrMdfXjEff94fXFCQMzr0XzLPkYzJqNYUhZUmgLxvF Jb4A4gi2PjKbBT3OF+gg8XLMPWHntt7SvlHazaY4UnEyEE0TC8OvJqSKA06WwyPkMiSu m9Uw== X-Gm-Message-State: AOAM533Kb3oArMlXlh9EnTrM7FNSBqOLv+sMdvO/QcOoUXdJotnELFFV JWBt1cZlCOf5g51acoswoMdNgA== X-Google-Smtp-Source: ABdhPJwOgFcaAM1SqJAGPDVFQHTHhzSQvoeXR7d5zK50qM/4oGA1rq1uhxul/Uqe9SHPgzfknrF7cg== X-Received: by 2002:a7b:c38d:: with SMTP id s13mr7305541wmj.115.1620086105824; Mon, 03 May 2021 16:55:05 -0700 (PDT) Received: from localhost ([64.43.159.102]) by smtp.gmail.com with ESMTPSA id f4sm14215911wrz.33.2021.05.03.16.55.05 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 03 May 2021 16:55:05 -0700 (PDT) From: "Basil L. Contovounesios" <contovob@HIDDEN> To: Stefan Kangas <stefan@HIDDEN> Subject: Re: bug#48211: 28.0.50; eww strips whitespace between <mark> elements References: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> Date: Tue, 04 May 2021 00:55:03 +0100 In-Reply-To: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> (Stefan Kangas's message of "Mon, 3 May 2021 18:16:06 -0500") Message-ID: <87y2cvl6eg.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 48211 Cc: Lars Ingebrigtsen <larsi@HIDDEN>, 48211 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) found 48211 24.1 quit Stefan Kangas <stefan@HIDDEN> writes: > Opening a HTML file in eww with <mark> elements strips whitespace > between elements. I think this is because libxml-parse-html-region specifies HTML_PARSE_NOBLANKS: Return CDATA sections (like <style>foo</style>) as text nodes. 3c2317e891 2010-12-06 17:59:52 +0100 https://git.sv.gnu.org/cgit/emacs.git/commit/?id=3c2317e89100833812a7194c0d9d39ae0f52cb33 -- Basil
bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available.Received: (at submit) by debbugs.gnu.org; 3 May 2021 23:16:18 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon May 03 19:16:18 2021 Received: from localhost ([127.0.0.1]:49290 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1ldhnW-0006W4-BH for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:16:18 -0400 Received: from lists.gnu.org ([209.51.188.17]:56234) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <stefankangas@HIDDEN>) id 1ldhnU-0006Vy-To for submit <at> debbugs.gnu.org; Mon, 03 May 2021 19:16:17 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:37472) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from <stefankangas@HIDDEN>) id 1ldhnS-00085r-Hp for bug-gnu-emacs@HIDDEN; Mon, 03 May 2021 19:16:16 -0400 Received: from mail-pg1-f182.google.com ([209.85.215.182]:43759) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from <stefankangas@HIDDEN>) id 1ldhnN-0008Cv-CG for bug-gnu-emacs@HIDDEN; Mon, 03 May 2021 19:16:11 -0400 Received: by mail-pg1-f182.google.com with SMTP id p12so4962118pgj.10 for <bug-gnu-emacs@HIDDEN>; Mon, 03 May 2021 16:16:08 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:mime-version:date:message-id:subject:to :content-transfer-encoding; bh=/doHh2CKegbzQT595Nm5vpoZhuU/9f+Xrb3SSzyb5gY=; b=h7MsXgi467sbvPsDdSA+jEjcT5AMZ/cpD6YWNBO0Rtn6IVt64u+WrJxKSLwuZgg1ZE QRU5My8VzKCBA096XueC/6Q7RQ5JNSq44SIJqY9qu5FkKHqkipLPXLs3EXEvIXvV1U0r N0nujVakm1t23c8EsJyLVFWh12N9c2+wsGYMVqnR0x1MfehGot7XOFc7ZtNV67yblF2i Qyfr4luO8yOAQvw5XfSdKqYvuzWj7UA7Ih9pweTqSgcCP1yvRd/7JcVBv7WbtHPBvXcZ sBpyxeKAsLq4CuKQ+Qsw/sgRm6Ed9eGkLQvoMe/4y70PuoXmwOUkgs6YJ5Sq5//CUWWk fVEA== X-Gm-Message-State: AOAM532a1+La7bdhehjcO9dO1b4KYdwfYxKWK30fJ3IuhV4IwedpZMeA XNdzN3rEKQupmG1mNvip4OpC9zG/LU98JKNVBr1Rj3Ou X-Google-Smtp-Source: ABdhPJy82PlmXySLB/aa4tSidUctvZJV69erIXGLg2iwtpFOcrZgD2zS1qIpHGzIyysV8aKyfjZO9eQ6qXoaaiNTpaM= X-Received: by 2002:aa7:828f:0:b029:200:6e27:8c8f with SMTP id s15-20020aa7828f0000b02902006e278c8fmr20898511pfm.44.1620083767085; Mon, 03 May 2021 16:16:07 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 3 May 2021 18:16:06 -0500 From: Stefan Kangas <stefan@HIDDEN> X-Debbugs-CC: Lars Ingebrigtsen <larsi@HIDDEN> MIME-Version: 1.0 Date: Mon, 3 May 2021 18:16:06 -0500 Message-ID: <CADwFkm=vB2qufHVC0Q+_RMeDDC=YbgTBbs474jKO0ECY888YTA@HIDDEN> Subject: 28.0.50; eww strips whitespace between <mark> elements To: bug-gnu-emacs@HIDDEN Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=209.85.215.182; envelope-from=stefankangas@HIDDEN; helo=mail-pg1-f182.google.com X-Spam_score_int: 0 X-Spam_score: 0.0 X-Spam_bar: / X-Spam_report: (0.0 / 5.0 requ) BAYES_05=-0.5, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.25, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-Spam-Score: -0.8 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.8 (-) Opening a HTML file in eww with <mark> elements strips whitespace between elements. Steps to reproduce: 0. echo "<p><mark>foo</mark> <mark>bar</mark></p>" > /tmp/foo.html 1. emacs -Q 2. M-x eww RET file:///tmp/foo.html RET Result is that I see, in the eww buffer: "foobar" Expected result is: "foo bar" For a real world example where this matters, see: https://dle.rae.es/palabra In eww, I get: 1. f. Unidadling=C3=BC=C3=ADstica, dotadageneralmentedesignificado, queseseparadelasdem=C3=A1smediantepausaspotencialesenlapronunciaci=C3=B3n= yblancosenlaescritura. In Firefox, I get: 1. f. Unidad ling=C3=BC=C3=ADstica, dotada generalmente de significado, q= ue se separa de las dem=C3=A1s mediante pausas potenciales en la pronunciaci=C3= =B3n y blancos en la escritura.
Stefan Kangas <stefan@HIDDEN>
:larsi@HIDDEN, bug-gnu-emacs@HIDDEN
.
Full text available.larsi@HIDDEN, bug-gnu-emacs@HIDDEN
:bug#48211
; Package emacs
.
Full text available.
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997 nCipher Corporation Ltd,
1994-97 Ian Jackson.