Skip to content

[BUG] Special Spaces Cause Incorrect URL Parsing #16856

@softmgr

Description

@softmgr

Is there an existing issue for this?

  • I have searched the existing issues

Description of the Issue

Known space types include the following:

//L' ', (Space), U+0020, 32
//L' ', (Ideographic Space), U+3000, 12288
//L' ', (No-Break Space), U+00A0, 160
//L' ', (EN SPACE), U+2002, 8194
//L' ', (EM SPACE), U+2003, 8195
//L' ', (THREE-PER-EM), U+2004, 8196
//L' ', (FOUR-PER-EM), U+2005, 8197
//L' ', (SIX-PER-EM), U+2006, 8198
//L' ', (Figure Space), U+2007, 8199
//L' ', (Punctuation Space), U+2008, 8200
//L' ', (Thin Space), U+2009, 8201
//L' ', (Hair Space), U+200A, 8202
//L'​', (Zero Width Space), U+200B, 8203
//L' ', (Narrow NBSP), U+202F, 8239
//L' ', (Medium Mathematical Space), U+205F, 8287
//L'', (ZWNBSP / BOM), U+FEFF, 65279

In addition to the regular space (U+0020), if other special space characters appear in a URL, the program will mistakenly connect them together. For example:

 https://www.github.com/ https://www.google.com/ /abc/ /xyz/

Steps To Reproduce

Enter the above URL in the editor.

Current Behavior

In the example above, Notepad++ incorrectly recognizes them as a single URL.

Expected Behavior

There are actually two URLs: https://www.github.com/ and https://www.google.com/. They are separated by special space characters. The special space after the second URL is not treated as a separator.

Debug Information

Notepad++ v8.8.1   (64-bit)
Build time : May  3 2025 - 18:41:09
Scintilla/Lexilla included : 5.5.6/5.4.4
Boost Regex included : 1_85
Path : C:\Program Files\Notepad++\notepad++.exe
Command Line : "D:\backup1\Paperasse\vélib.txt"
Admin mode : OFF
Local Conf mode : OFF
Cloud Config : OFF
Periodic Backup : ON
Placeholders : OFF
Scintilla Rendering Mode : SC_TECHNOLOGY_DIRECTWRITE (1)
Multi-instance Mode : monoInst
File Status Auto-Detection : cdEnabledNew (for current file/tab only)
Dark Mode : OFF
OS Name : Windows 11 Home (64-bit)
OS Version : 24H2
OS Build : 26100.4349
Current ANSI codepage : 1252
Plugins : 
    mimeTools (3.1)

Anything else?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions