GNU bug report logs - #26477
what-cursor-position should mention "U+"

Previous Next

Package: emacs;

Reported by: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>

Date: Thu, 13 Apr 2017 11:48:01 UTC

Severity: wishlist

Tags: wontfix

Merged with 33445, 35704

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 26477 in the body.
You can then email your comments to 26477 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#26477; Package emacs. (Thu, 13 Apr 2017 11:48:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Thu, 13 Apr 2017 11:48:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: bug-gnu-emacs <bug-gnu-emacs <at> gnu.org>
Subject: what-cursor-position should mention "U+"
Date: Thu, 13 Apr 2017 18:45:10 +0800
C-x = (translated from <return>) runs the command what-cursor-position
(found in global-map), which is an interactive compiled Lisp function
in ‘simple.el’.

says:

             position: 120538 of 121236 (99%), column: 0
            character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
    preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of Big5 (Chinese traditional))
code point in charset: 0x2172
               script: symbol
               syntax: _ 	which means: symbol
             category: .:Base, c:Chinese, h:Korean, j:Japanese
             to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
          buffer code: #xE2 #x97 #x8B
            file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
    x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)

Character code properties: customize what to show
  name: WHITE CIRCLE
  general-category: So (Symbol, Other)
  decomposition: (9675) ('○')

Why can't it say U+25CB WHITE CIRCLE
anywhere (except fragmented all over the place)?

$ unicode ○
U+25CB WHITE CIRCLE
UTF-8: e2 97 8b UTF-16BE: 25cb Decimal: &#9675; Octal: \022713
○
Category: So (Symbol, Other)
Unicode block: 25A0..25FF; Geometric Shapes
Bidi: ON (Other Neutrals)

P.S., "translated from return"
was due to  C-x <escape> <escape> [repeat-complex-command].




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#26477; Package emacs. (Thu, 13 Apr 2017 11:59:01 GMT) Full text and rfc822 format available.

Message #8 received at 26477 <at> debbugs.gnu.org (full text, mbox):

From: npostavs <at> users.sourceforge.net
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Cc: 26477 <at> debbugs.gnu.org
Subject: Re: bug#26477: what-cursor-position should mention "U+"
Date: Thu, 13 Apr 2017 07:59:39 -0400
severity 26477 wishlist
quit

積丹尼 Dan Jacobson <jidanni <at> jidanni.org> writes:

>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?

What is "U+", and why should we mention it?





Severity set to 'wishlist' from 'normal' Request was from npostavs <at> users.sourceforge.net to control <at> debbugs.gnu.org. (Thu, 13 Apr 2017 11:59:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#26477; Package emacs. (Thu, 13 Apr 2017 12:11:03 GMT) Full text and rfc822 format available.

Message #13 received at 26477 <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: npostavs <at> users.sourceforge.net
Cc: 26477 <at> debbugs.gnu.org
Subject: Re: bug#26477: what-cursor-position should mention "U+"
Date: Thu, 13 Apr 2017 20:10:05 +0800
http://stackoverflow.com/questions/1273693/why-is-u-used-to-designate-a-unicode-code-point




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#26477; Package emacs. (Thu, 20 Apr 2017 10:14:02 GMT) Full text and rfc822 format available.

Message #16 received at 26477 <at> debbugs.gnu.org (full text, mbox):

From: Philipp Stephani <p.stephani2 <at> gmail.com>
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>, 
 26477 <at> debbugs.gnu.org
Subject: Re: bug#26477: what-cursor-position should mention "U+"
Date: Thu, 20 Apr 2017 10:12:55 +0000
[Message part 1 (text/plain, inline)]
積丹尼 Dan Jacobson <jidanni <at> jidanni.org> schrieb am Do., 13. Apr. 2017 um
13:48 Uhr:

> C-x = (translated from <return>) runs the command what-cursor-position
> (found in global-map), which is an interactive compiled Lisp function
> in ‘simple.el’.
>
> says:
>
>              position: 120538 of 121236 (99%), column: 0
>             character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
>     preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of
> Big5 (Chinese traditional))
> code point in charset: 0x2172
>                script: symbol
>                syntax: _        which means: symbol
>              category: .:Base, c:Chinese, h:Korean, j:Japanese
>              to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
>           buffer code: #xE2 #x97 #x8B
>             file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
>               display: by this font (glyph code)
>     x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)
>
> Character code properties: customize what to show
>   name: WHITE CIRCLE
>   general-category: So (Symbol, Other)
>   decomposition: (9675) ('○')
>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?
>

If you ask for the reason, I guess it's just legacy. I assume
`describe-char' is much older than the Unicode support in Emacs, and it was
never thoroughly redesigned (the Unicode properties are all at the bottom).
My suggestion would be to replace the "(codepoint ...)" part with the
standard code point description "(U+NNNN character name)", and either get
rid of most of the non-Unicode properties (preferred charset, code point in
charset, buffer code, file code) or move them further down. That should be
a relatively simple change in the code of `describe-char'.
[Message part 2 (text/html, inline)]

Merged 26477 33445. Request was from Noam Postavsky <npostavs <at> gmail.com> to control <at> debbugs.gnu.org. (Wed, 21 Nov 2018 14:20:02 GMT) Full text and rfc822 format available.

Merged 26477 33445 35704. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Mon, 13 May 2019 00:31:02 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 26477 <at> debbugs.gnu.org and 積丹尼 Dan Jacobson <jidanni <at> jidanni.org> Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Fri, 23 Aug 2019 05:34:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Fri, 20 Sep 2019 11:24:03 GMT) Full text and rfc822 format available.

This bug report was last modified 4 years and 191 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.