GNU bug report logs -
#26477
what-cursor-position should mention "U+"
Previous Next
To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 26477 in the body.
You can then email your comments to 26477 AT debbugs.gnu.org in the normal way.
Toggle the display of automated, internal messages from the tracker.
Report forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#26477
; Package
emacs
.
(Thu, 13 Apr 2017 11:48:01 GMT)
Full text and
rfc822 format available.
Acknowledgement sent
to
積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
:
New bug report received and forwarded. Copy sent to
bug-gnu-emacs <at> gnu.org
.
(Thu, 13 Apr 2017 11:48:02 GMT)
Full text and
rfc822 format available.
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
C-x = (translated from <return>) runs the command what-cursor-position
(found in global-map), which is an interactive compiled Lisp function
in ‘simple.el’.
says:
position: 120538 of 121236 (99%), column: 0
character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of Big5 (Chinese traditional))
code point in charset: 0x2172
script: symbol
syntax: _ which means: symbol
category: .:Base, c:Chinese, h:Korean, j:Japanese
to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
buffer code: #xE2 #x97 #x8B
file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
display: by this font (glyph code)
x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)
Character code properties: customize what to show
name: WHITE CIRCLE
general-category: So (Symbol, Other)
decomposition: (9675) ('○')
Why can't it say U+25CB WHITE CIRCLE
anywhere (except fragmented all over the place)?
$ unicode ○
U+25CB WHITE CIRCLE
UTF-8: e2 97 8b UTF-16BE: 25cb Decimal: ○ Octal: \022713
○
Category: So (Symbol, Other)
Unicode block: 25A0..25FF; Geometric Shapes
Bidi: ON (Other Neutrals)
P.S., "translated from return"
was due to C-x <escape> <escape> [repeat-complex-command].
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#26477
; Package
emacs
.
(Thu, 13 Apr 2017 11:59:01 GMT)
Full text and
rfc822 format available.
Message #8 received at 26477 <at> debbugs.gnu.org (full text, mbox):
severity 26477 wishlist
quit
積丹尼 Dan Jacobson <jidanni <at> jidanni.org> writes:
>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?
What is "U+", and why should we mention it?
Severity set to 'wishlist' from 'normal'
Request was from
npostavs <at> users.sourceforge.net
to
control <at> debbugs.gnu.org
.
(Thu, 13 Apr 2017 11:59:02 GMT)
Full text and
rfc822 format available.
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#26477
; Package
emacs
.
(Thu, 13 Apr 2017 12:11:03 GMT)
Full text and
rfc822 format available.
Message #13 received at 26477 <at> debbugs.gnu.org (full text, mbox):
http://stackoverflow.com/questions/1273693/why-is-u-used-to-designate-a-unicode-code-point
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#26477
; Package
emacs
.
(Thu, 20 Apr 2017 10:14:02 GMT)
Full text and
rfc822 format available.
Message #16 received at 26477 <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
積丹尼 Dan Jacobson <jidanni <at> jidanni.org> schrieb am Do., 13. Apr. 2017 um
13:48 Uhr:
> C-x = (translated from <return>) runs the command what-cursor-position
> (found in global-map), which is an interactive compiled Lisp function
> in ‘simple.el’.
>
> says:
>
> position: 120538 of 121236 (99%), column: 0
> character: ○ (displayed as ○) (codepoint 9675, #o22713, #x25cb)
> preferred charset: chinese-big5-1 (Frequently used part (A141-C67E) of
> Big5 (Chinese traditional))
> code point in charset: 0x2172
> script: symbol
> syntax: _ which means: symbol
> category: .:Base, c:Chinese, h:Korean, j:Japanese
> to input: type "C-x 8 RET 25cb" or "C-x 8 RET WHITE CIRCLE"
> buffer code: #xE2 #x97 #x8B
> file code: #xE2 #x97 #x8B (encoded by coding system utf-8-unix)
> display: by this font (glyph code)
> x:-eten-fixed-medium-r-normal--16-150-75-75-c-160-big5.eten-0 (#xA1B3)
>
> Character code properties: customize what to show
> name: WHITE CIRCLE
> general-category: So (Symbol, Other)
> decomposition: (9675) ('○')
>
> Why can't it say U+25CB WHITE CIRCLE
> anywhere (except fragmented all over the place)?
>
If you ask for the reason, I guess it's just legacy. I assume
`describe-char' is much older than the Unicode support in Emacs, and it was
never thoroughly redesigned (the Unicode properties are all at the bottom).
My suggestion would be to replace the "(codepoint ...)" part with the
standard code point description "(U+NNNN character name)", and either get
rid of most of the non-Unicode properties (preferred charset, code point in
charset, buffer code, file code) or move them further down. That should be
a relatively simple change in the code of `describe-char'.
[Message part 2 (text/html, inline)]
Merged 26477 33445.
Request was from
Noam Postavsky <npostavs <at> gmail.com>
to
control <at> debbugs.gnu.org
.
(Wed, 21 Nov 2018 14:20:02 GMT)
Full text and
rfc822 format available.
bug closed, send any further explanations to
26477 <at> debbugs.gnu.org and 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Request was from
Lars Ingebrigtsen <larsi <at> gnus.org>
to
control <at> debbugs.gnu.org
.
(Fri, 23 Aug 2019 05:34:02 GMT)
Full text and
rfc822 format available.
bug archived.
Request was from
Debbugs Internal Request <help-debbugs <at> gnu.org>
to
internal_control <at> debbugs.gnu.org
.
(Fri, 20 Sep 2019 11:24:03 GMT)
Full text and
rfc822 format available.
This bug report was last modified 4 years and 191 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.