Opened 18 years ago
Closed 18 years ago
#471 closed Bug (invalid)
Email Link HTML Encoding Problem
Reported by: | martin | Owned by: | Martin Kou |
---|---|---|---|
Priority: | Normal | Milestone: | FCKeditor 2.5 Beta |
Component: | General | Version: | |
Keywords: | Confirmed | Cc: |
Description
When creating a link of type "E-mail", upon entering content into the subject or body field if a british pound symbol (£) is entered the resultant html for the pound symbol becomes "%C2%A3" which displays "£" when decoded.
The problem can easily be fixed by going through the source and removing "%C2" everywhere it occurs, but this then breaks the link editor.
This can be replicated on the nightly build of the editor.
Change History (3)
comment:1 Changed 18 years ago by
Keywords: | Confirmed added; link removed |
---|---|
Milestone: | → FCKeditor 2.5 |
comment:2 Changed 18 years ago by
Owner: | set to Martin Kou |
---|---|
Status: | new → assigned |
comment:3 Changed 18 years ago by
Resolution: | → invalid |
---|---|
Status: | assigned → closed |
Hi,
This is actually the correct behavior.
According to RFC3986, data in URL components should be encoded in the UTF-8 format.
Now, if you look at Section 3 of RFC3629, you'll see that for unicode code points in the range [0x80, 0x7ff], which includes the British pound symbol, the code point value would be split over two bytes.
So, according to the formulae given in the RFC, the British Pound symbol, at unicode code point 0xa3, which is 10100011 in binary, would be split into 11000010 10100011, which is 0xC2 0xA3 in hex. So the maths is correct, and thus the URL encoder in FCKeditor is doing what it is supposed to do.
Confirmed with IE6 and FF2.