Ticket #471 (closed Bug: invalid)
Email Link HTML Encoding Problem
| Reported by: | martin | Owned by: | martinkou |
|---|---|---|---|
| Priority: | Normal | Milestone: | FCKeditor 2.5 Beta |
| Component: | General | Version: | |
| Keywords: | Confirmed | Cc: |
Description
When creating a link of type "E-mail", upon entering content into the subject or body field if a british pound symbol (£) is entered the resultant html for the pound symbol becomes "%C2%A3" which displays "£" when decoded.
The problem can easily be fixed by going through the source and removing "%C2" everywhere it occurs, but this then breaks the link editor.
This can be replicated on the nightly build of the editor.
Change History
comment:1 Changed 6 years ago by fredck
- Keywords Confirmed added; link removed
- Milestone set to FCKeditor 2.5
comment:2 Changed 6 years ago by martinkou
- Status changed from new to assigned
- Owner set to martinkou
comment:3 Changed 6 years ago by martinkou
- Status changed from assigned to closed
- Resolution set to invalid
Hi,
This is actually the correct behavior.
According to RFC3986, data in URL components should be encoded in the UTF-8 format.
Now, if you look at Section 3 of RFC3629, you'll see that for unicode code points in the range [0x80, 0x7ff], which includes the British pound symbol, the code point value would be split over two bytes.
So, according to the formulae given in the RFC, the British Pound symbol, at unicode code point 0xa3, which is 10100011 in binary, would be split into 11000010 10100011, which is 0xC2 0xA3 in hex. So the maths is correct, and thus the URL encoder in FCKeditor is doing what it is supposed to do.

Confirmed with IE6 and FF2.