Re: [code] [textadept] Encoding and display

From: Michal Kottman <k0mpjut0r.att.gmail.com>
Date: Thu, 25 Jul 2019 08:45:26 +0200

On Thu, Jul 25, 2019, 3:38 AM Mitchell <m.att.foicica.com> wrote:

> Encoding detection is a very tricky thing that is hard to get right. Sure,
> Textadept could employ the help of a multi-megabyte library whose sole job
> is to identify the encoding of a chunk of text thrown at it, but that is
> not very minimalist! Anyway, we've mostly settled on UTF-8 and UTF-16 for
> most text, and ASCII and ISO-8859-1 for programming languages. That's why
> Textadept looks for them primarily.
>

Just for the record, there's an external tool called "enca" [1] that tries
to auto-detect encoding using several methods, e.g. statistical analysis.

[1] https://linux.die.net/man/1/enca

>

-- 
You are subscribed to code.att.foicica.com.
To change subscription settings, send an e-mail to code+help.att.foicica.com.
To unsubscribe, send an e-mail to code+unsubscribe.att.foicica.com.
Received on Thu 25 Jul 2019 - 02:45:26 EDT

This archive was generated by hypermail 2.2.0 : Thu 25 Jul 2019 - 06:34:15 EDT