Re: [code] [textadept] Defining chars above 127 as non word_chars

From: Mitchell <>
Date: Wed, 19 Aug 2015 09:55:59 -0400 (EDT)

Hi Giovanni,

On Tue, 18 Aug 2015, Giovanni Salmeri wrote:

> Hello everybody.
> I am stuck with a little problem. In its default configuration,
> Scintilla/Textadept considers all the characters above 127 as word
> characters. Obviously this is not really the case: in particular the
> quotes « and » (normal in many languages) should be treated as
> punctuation characters. The problem is that the presence or absence of
> chars above 127 seems to be ignored in buffer.word_chars. If I delete
> from it the character "a", "a" becomes a non-word character; but if I
> delete string.char(171) and string.char(187) (that is « and »), nothing
> seems to change. Perhaps there is something related to encoding that I
> should know? Also the documentation of Scintilla does not explain
> more. Can anybody help me?

I would guess those characters are being inserted as UTF-8, and thus are
not actually represented as a single byte characters. Unfortunately
Scintilla/Textadept does not have the capability to specify individual
multi-byte UTF-8 characters as word or punctuation characters.


You are subscribed to
To change subscription settings, send an e-mail to
To unsubscribe, send an e-mail to
Received on Wed 19 Aug 2015 - 09:55:59 EDT

This archive was generated by hypermail 2.2.0 : Thu 20 Aug 2015 - 06:50:25 EDT