GNU bug report logs - #10576
Subject: 23.4; char class [:lower:] misses latin small letter sharp s

Previous Next

Package: emacs;

Reported by: Andreas Röhler <andreas.roehler <at> easy-emacs.de>

Date: Sun, 22 Jan 2012 13:26:02 UTC

Severity: normal

Merged with 16731

Found in version 24.3.50

Fixed in version 28.1

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 10576 in the body.
You can then email your comments to 10576 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#10576; Package emacs. (Sun, 22 Jan 2012 13:26:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Andreas Röhler <andreas.roehler <at> easy-emacs.de>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Sun, 22 Jan 2012 13:26:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Andreas Röhler <andreas.roehler <at> easy-emacs.de>
To: bug-gnu-emacs <at> gnu.org
Subject: Subject: 23.4; char class [:lower:] misses latin small letter sharp s
Date: Sun, 22 Jan 2012 14:24:54 +0100
Hi,

in scratch-buffer, emacs -Q,

cursor at end of line at LATIN SMALL LETTER SHARP S,

evaluating the form below results in "0", i.e. no move

(skip-chars-forward "[:lower:]")ß

AFAIU [:lower:] doesn't recognise that char

C-u C-x =
says:

        character: ß (223, #o337, #xdf)
preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1))
       code point: 0xDF
           syntax: w 	which means: word
         category: .:Base, h:Korean, j:Japanese, l:Latin
      buffer code: #xC3 #x9F
        file code: #xC3 #x9F (encoded by coding system utf-8-unix)
          display: by this font (glyph code)
    xft:-unknown-DejaVu Sans 
Mono-normal-normal-normal-*-12-*-*-*-m-0-iso10646-1 (#xA1)

Character code properties: customize what to show
  name: LATIN SMALL LETTER SHARP S
  general-category: Ll (Letter, Lowercase)

There are text properties here:
  fontified            t

;;;;;;;;;;;

thanks all,

Andreas

In GNU Emacs 23.4.1 (i686-pc-linux-gnu, X toolkit, Xaw3d scroll bars)
 of 2012-01-20
Windowing system distributor `The X.Org Foundation', version 11.0.10903000
configured using `configure  'CFLAGS=-g -O''

Important settings:
  value of $LC_ALL: nil
  value of $LC_COLLATE: nil
  value of $LC_CTYPE: nil
  value of $LC_MESSAGES: nil
  value of $LC_MONETARY: nil
  value of $LC_NUMERIC: nil
  value of $LC_TIME: nil
  value of $LANG: de_DE.UTF-8
  value of $XMODIFIERS: @im=local
  locale-coding-system: utf-8-unix
  default enable-multibyte-characters: t

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t

Recent input:
( s k i p - c h a r s - f o r w a r d SPC " [ [ : l
o w e r : ] ] " ) a s d f <left> <left> <left> <left>
C-x C-e <left> <left> <left> <left> <left> <left> <left>
<left> <left> <left> <left> <left> <left> <left> <left>
<left> <left> C-d <right> <right> <right> <right> <right>
<right> <right> <right> <right> C-d <right> <right>
C-x C-e <left> <right> <backspace> <backspace> <backspace>
<backspace> ? <left> C-d ß <left> C-x C-e C-u C-c C-r
C-u C-x C-e <backspace> <right> <return> <return> <up>
<up> <up> <return> e v a l u a t i n g SPC t h e SPC
f o r m SPC b e l w o <backspace> <backspace> o w SPC
r e s u l t s SPC i n SPC " 0 " , SPC i . e . SPC <return>
<left> n o SPC m o v e <down> <down> <return> <return>
<up> <left> <left> C-x C-e M-x r e p o r t - e a c
<backspace> <backspace> m a c s - b u <tab> <retur
n>

Recent messages:
For information about GNU Emacs and the GNU system, type C-h C-a.
4 [2 times]
0
0

Load-path shadows:
None found.

Features:
(shadow sort mail-extr message sendmail regexp-opt ecomplete rfc822 mml
easymenu mml-sec password-cache mm-decode mm-bodies mm-encode mailcap
mail-parse rfc2231 rfc2047 rfc2045 qp ietf-drums mailabbrev nnheader
gnus-util netrc time-date mm-util mail-prsvr gmm-utils wid-edit
mailheader canlock sha1 hex-util hashcash mail-utils emacsbug tooltip
ediff-hook vc-hooks lisp-float-type mwheel x-win x-dnd font-setting
tool-bar dnd fontset image fringe lisp-mode register page menu-bar
rfn-eshadow timer select scroll-bar mldrag mouse jit-lock font-lock
syntax facemenu font-core frame cham georgian utf-8-lang misc-lang
vietnamese tibetan thai tai-viet lao korean japanese hebrew greek
romanian slovak czech european ethiopic indian cyrillic chinese
case-table epa-hook jka-cmpr-hook help simple abbrev loaddefs button
minibuffer faces cus-face files text-properties overlay md5 base64
format env code-pages mule custom widget hashtable-print-readable
backquote make-network-process dbusbind system-font-setting
font-render-setting x-toolkit x multi-tty emacs)




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#10576; Package emacs. (Sun, 22 Jan 2012 14:05:02 GMT) Full text and rfc822 format available.

Message #8 received at 10576 <at> debbugs.gnu.org (full text, mbox):

From: Andreas Schwab <schwab <at> linux-m68k.org>
To: Andreas Röhler <andreas.roehler <at> easy-emacs.de>
Cc: 10576 <at> debbugs.gnu.org
Subject: Re: bug#10576: Subject: 23.4;
	char class [:lower:] misses latin small letter sharp s
Date: Sun, 22 Jan 2012 15:04:38 +0100
Andreas Röhler <andreas.roehler <at> easy-emacs.de> writes:

> evaluating the form below results in "0", i.e. no move
>
> (skip-chars-forward "[:lower:]")ß
>
> AFAIU [:lower:] doesn't recognise that char

That's because ?ß has no upper case, so lowercasep in src/buffer.h
return 0.

(Actually in Unicode, ?ß does have a capital variant, ?ẞ, but while
lowercasing ?ẞ maps to ?ß, uppercasing ?ß actually maps to "SS".)

Andreas.

-- 
Andreas Schwab, schwab <at> linux-m68k.org
GPG Key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."




Merged 10576 16731. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 12 Feb 2014 17:56:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#10576; Package emacs. (Fri, 16 Jul 2021 12:33:01 GMT) Full text and rfc822 format available.

Message #13 received at 10576 <at> debbugs.gnu.org (full text, mbox):

From: Lars Ingebrigtsen <larsi <at> gnus.org>
To: Jorgen Schaefer <forcer <at> forcix.cx>
Cc: 10576 <at> debbugs.gnu.org, 16731 <at> debbugs.gnu.org
Subject: Re: bug#10576: Subject: 23.4; char class [:lower:] misses latin
 small letter sharp s
Date: Fri, 16 Jul 2021 14:32:41 +0200
Jorgen Schaefer <forcer <at> forcix.cx> writes:

> The following seems like a bug:
>
> (string-match "[[:lower:]]" "ß") => nil

This has been fixed in Emacs 28.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no




bug marked as fixed in version 28.1, send any further explanations to 10576 <at> debbugs.gnu.org and Andreas Röhler <andreas.roehler <at> easy-emacs.de> Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Fri, 16 Jul 2021 12:33:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Sat, 14 Aug 2021 11:24:05 GMT) Full text and rfc822 format available.

This bug report was last modified 2 years and 264 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.