Skip to content

uchardet is hot garbage #974

@coladict

Description

@coladict

Operating System and Version

Linux. NN version v0.12

Distribution

source

Description

Multiple times now I've opened up files saved with this editor as UTF-8 only to have them garbled on reading.
The encoding detection library is reporting them as IBM852.

Steps to Reproduce

  1. Save file with a word like Zürich
  2. Close editor
  3. Open file again

Debug Information

No response

Additional Details

Solution: Do not attempt to read different encodings until feature for changing encodings is implemented. Default to UTF-8.

Additional suggestion: When implementing encoding conversion, upon change, prompt user whether to reinterpret file as the new encoding or convert to it. This is something Notepad++ currently does wrong. It is a mistake to avoid.

I may get involved in this, but do not expect to have time this month.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions