GNU bug report logs - #36070
27; feature request '(Describe Char Unidata List) to include 'kDefinition' value

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: emacs; Severity: wishlist; Reported by: Van L <van@HIDDEN>; dated Mon, 3 Jun 2019 12:14:01 UTC; Maintainer for emacs is bug-gnu-emacs@HIDDEN.

Message received at 36070 <at> debbugs.gnu.org:


Received: (at 36070) by debbugs.gnu.org; 10 Jun 2019 06:17:02 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Jun 10 02:17:02 2019
Received: from localhost ([127.0.0.1]:55842 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1haDc6-0002Ax-Fx
	for submit <at> debbugs.gnu.org; Mon, 10 Jun 2019 02:17:02 -0400
Received: from relay3-d.mail.gandi.net ([217.70.183.195]:42013)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <van@HIDDEN>) id 1haDc3-0002AR-BA
 for 36070 <at> debbugs.gnu.org; Mon, 10 Jun 2019 02:17:01 -0400
X-Originating-IP: 110.174.240.215
Received: from epi.local (110-174-240-215.static.tpgi.com.au [110.174.240.215])
 (Authenticated sender: van@HIDDEN)
 by relay3-d.mail.gandi.net (Postfix) with ESMTPSA id 78E9960005;
 Mon, 10 Jun 2019 06:16:49 +0000 (UTC)
Content-Type: text/plain; charset=utf-8
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
Subject: Re: bug#36070: 27;
 feature request '(Describe Char Unidata List) to include
 'kDefinition' value
From: Van L <van@HIDDEN>
In-Reply-To: <83tvd6u2rr.fsf@HIDDEN>
Date: Mon, 10 Jun 2019 16:16:45 +1000
Content-Transfer-Encoding: quoted-printable
Message-Id: <A5AF45DA-C46D-483C-8557-A4EA675ABDB6@HIDDEN>
References: <8C3E021E-FB25-4948-8E5F-1395590BAA66@HIDDEN>
 <83tvd6u2rr.fsf@HIDDEN>
To: Eli Zaretskii <eliz@HIDDEN>
X-Mailer: Apple Mail (2.3124)
X-Spam-Score: -0.2 (/)
X-Debbugs-Envelope-To: 36070
Cc: 36070 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.2 (/)


> On 4 Jun 2019, at 01:06, Eli Zaretskii <eliz@HIDDEN> wrote:
>=20
>> --8<---------------cut here---------------start------------->8---
>> Character code properties: customize what to show
>>  name: CJK IDEOGRAPH-5165
>>  general-category: Lo (Letter, Other)
>>  decomposition: (20837) ('=E5=85=A5')
>> --8<---------------cut here---------------end--------------->8---
>=20
> This comes from UnicodeData.txt, our source for the Unicode properties
> of all the characters.  We parse it into uni-*.el files as part of the
> build.
>=20
>> The Readings table, in particular, is nice to have for the =
'kDefinition'.
>>=20
>> --8<---------------cut here---------------start------------->8---
>> | Data type   | Value                    |
>> |-------------+--------------------------|
>> | kDefinition | enter, come in(to), join |
>> |             |                          |
>> --8<---------------cut here---------------end--------------->8---
>=20
> This comes from Unihan_Reading.txt, a different file that is part of
> the Unihan database.
>=20
> We don't currently have a property where to put this value, so we need
> first to extend the properties.  And then we will need to parse the
> above file and populate the property.  Patches welcome.  Bonus points
> for reviewing other properties of the Unihan DB and adding whatever is
> useful.  See UAX#38 (http://www.unicode.org/reports/tr38/), for the
> description of the properties.

Thanks for pointing this out. I definitely want to know more about the =
Unihan DB and extend the handling of this information.

-- Van





Information forwarded to bug-gnu-emacs@HIDDEN:
bug#36070; Package emacs. Full text available.

Message received at 36070 <at> debbugs.gnu.org:


Received: (at 36070) by debbugs.gnu.org; 3 Jun 2019 15:07:00 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Jun 03 11:07:00 2019
Received: from localhost ([127.0.0.1]:42694 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1hXoY8-0000zT-0E
	for submit <at> debbugs.gnu.org; Mon, 03 Jun 2019 11:07:00 -0400
Received: from eggs.gnu.org ([209.51.188.92]:54805)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <eliz@HIDDEN>) id 1hXoY4-0000zC-8C
 for 36070 <at> debbugs.gnu.org; Mon, 03 Jun 2019 11:06:57 -0400
Received: from fencepost.gnu.org ([2001:470:142:3::e]:46157)
 by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <eliz@HIDDEN>)
 id 1hXoXv-0005HA-O3; Mon, 03 Jun 2019 11:06:47 -0400
Received: from [176.228.60.248] (port=3783 helo=home-c4e4a596f7)
 by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256)
 (Exim 4.82) (envelope-from <eliz@HIDDEN>)
 id 1hXoXq-0003RK-Ee; Mon, 03 Jun 2019 11:06:43 -0400
Date: Mon, 03 Jun 2019 18:06:32 +0300
Message-Id: <83tvd6u2rr.fsf@HIDDEN>
From: Eli Zaretskii <eliz@HIDDEN>
To: Van L <van@HIDDEN>
In-reply-to: <8C3E021E-FB25-4948-8E5F-1395590BAA66@HIDDEN> (message
 from Van L on Mon, 3 Jun 2019 22:00:30 +1000)
Subject: Re: bug#36070: 27;
 feature request '(Describe Char Unidata List) to include
 'kDefinition' value
References: <8C3E021E-FB25-4948-8E5F-1395590BAA66@HIDDEN>
MIME-version: 1.0
Content-type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-Spam-Score: -2.3 (--)
X-Debbugs-Envelope-To: 36070
Cc: 36070 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -3.3 (---)

> From: Van L <van@HIDDEN>
> Date: Mon, 3 Jun 2019 22:00:30 +1000
> 
> The details retrieved by 'M-x describe-char' on '入' show the following
> 
> --8<---------------cut here---------------start------------->8---
> Character code properties: customize what to show
>   name: CJK IDEOGRAPH-5165
>   general-category: Lo (Letter, Other)
>   decomposition: (20837) ('入')
> --8<---------------cut here---------------end--------------->8---

This comes from UnicodeData.txt, our source for the Unicode properties
of all the characters.  We parse it into uni-*.el files as part of the
build.

> Following the customize link to 'Describe Char Unidata List' 
> I find more information can be had from [1] .
> 
> The Readings table, in particular, is nice to have for the 'kDefinition'.
> 
> --8<---------------cut here---------------start------------->8---
> | Data type   | Value                    |
> |-------------+--------------------------|
> | kDefinition | enter, come in(to), join |
> |             |                          |
> --8<---------------cut here---------------end--------------->8---

This comes from Unihan_Reading.txt, a different file that is part of
the Unihan database.

We don't currently have a property where to put this value, so we need
first to extend the properties.  And then we will need to parse the
above file and populate the property.  Patches welcome.  Bonus points
for reviewing other properties of the Unihan DB and adding whatever is
useful.  See UAX#38 (http://www.unicode.org/reports/tr38/), for the
description of the properties.

Thanks.




Information forwarded to bug-gnu-emacs@HIDDEN:
bug#36070; Package emacs. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 3 Jun 2019 12:13:53 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Jun 03 08:13:53 2019
Received: from localhost ([127.0.0.1]:41766 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1hXlqb-0000pd-78
	for submit <at> debbugs.gnu.org; Mon, 03 Jun 2019 08:13:53 -0400
Received: from eggs.gnu.org ([209.51.188.92]:45807)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <van@HIDDEN>) id 1hXlqY-0000pO-7X
 for submit <at> debbugs.gnu.org; Mon, 03 Jun 2019 08:13:51 -0400
Received: from lists.gnu.org ([209.51.188.17]:41654)
 by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32)
 (Exim 4.71) (envelope-from <van@HIDDEN>) id 1hXlqT-0002PX-4I
 for submit <at> debbugs.gnu.org; Mon, 03 Jun 2019 08:13:45 -0400
Received: from eggs.gnu.org ([209.51.188.92]:59696)
 by lists.gnu.org with esmtp (Exim 4.71)
 (envelope-from <van@HIDDEN>) id 1hXlqR-00063n-M8
 for bug-gnu-emacs@HIDDEN; Mon, 03 Jun 2019 08:13:44 -0400
X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org
X-Spam-Level: 
X-Spam-Status: No, score=0.1 required=5.0 tests=BAYES_50,RCVD_IN_DNSWL_LOW,
 URIBL_BLOCKED autolearn=disabled version=3.3.2
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <van@HIDDEN>) id 1hXldu-0002oo-W7
 for bug-gnu-emacs@HIDDEN; Mon, 03 Jun 2019 08:00:48 -0400
Received: from relay7-d.mail.gandi.net ([217.70.183.200]:59601)
 by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32)
 (Exim 4.71) (envelope-from <van@HIDDEN>) id 1hXldu-0002hC-PX
 for bug-gnu-emacs@HIDDEN; Mon, 03 Jun 2019 08:00:46 -0400
X-Originating-IP: 60.242.3.49
Received: from epi.local (60-242-3-49.tpgi.com.au [60.242.3.49])
 (Authenticated sender: van@HIDDEN)
 by relay7-d.mail.gandi.net (Postfix) with ESMTPSA id 32BE72000D
 for <bug-gnu-emacs@HIDDEN>; Mon,  3 Jun 2019 12:00:35 +0000 (UTC)
From: Van L <van@HIDDEN>
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
Subject: 27; feature request '(Describe Char Unidata List) to include
 'kDefinition' value
Message-Id: <8C3E021E-FB25-4948-8E5F-1395590BAA66@HIDDEN>
Date: Mon, 3 Jun 2019 22:00:30 +1000
To: bug-gnu-emacs@HIDDEN
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
X-Mailer: Apple Mail (2.3124)
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-Received-From: 217.70.183.200
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x
X-Spam-Score: -1.1 (-)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -2.1 (--)

Hello Emacs,

The details retrieved by 'M-x describe-char' on '=E5=85=A5' show the =
following

--8<---------------cut here---------------start------------->8---
Character code properties: customize what to show
  name: CJK IDEOGRAPH-5165
  general-category: Lo (Letter, Other)
  decomposition: (20837) ('=E5=85=A5')
--8<---------------cut here---------------end--------------->8---

Following the customize link to 'Describe Char Unidata List'=20
I find more information can be had from [1] .

The Readings table, in particular, is nice to have for the =
'kDefinition'.

--8<---------------cut here---------------start------------->8---
| Data type   | Value                    |
|-------------+--------------------------|
| kDefinition | enter, come in(to), join |
|             |                          |
--8<---------------cut here---------------end--------------->8---

WDYT? Thanks in advance.

[1] https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=3D%E5%85%A5=






Acknowledgement sent to Van L <van@HIDDEN>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs@HIDDEN. Full text available.
Report forwarded to bug-gnu-emacs@HIDDEN:
bug#36070; Package emacs. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Mon, 25 Nov 2019 12:00:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.