Ticket #471 (closed Bug: invalid)

Opened 7 years ago

Last modified 7 years ago

Email Link HTML Encoding Problem

Reported by: martin Owned by: martinkou
Priority: Normal Milestone: FCKeditor 2.5 Beta
Component: General Version:
Keywords: Confirmed Cc:

Description

When creating a link of type "E-mail", upon entering content into the subject or body field if a british pound symbol (£) is entered the resultant html for the pound symbol becomes "%C2%A3" which displays "£" when decoded.

The problem can easily be fixed by going through the source and removing "%C2" everywhere it occurs, but this then breaks the link editor.

This can be replicated on the nightly build of the editor.

Change History

comment:1 Changed 7 years ago by fredck

  • Keywords Confirmed added; link removed
  • Milestone set to FCKeditor 2.5

Confirmed with IE6 and FF2.

comment:2 Changed 7 years ago by martinkou

  • Owner set to martinkou
  • Status changed from new to assigned

comment:3 Changed 7 years ago by martinkou

  • Status changed from assigned to closed
  • Resolution set to invalid

Hi,

This is actually the correct behavior.

According to RFC3986, data in URL components should be encoded in the UTF-8 format.

Now, if you look at Section 3 of RFC3629, you'll see that for unicode code points in the range [0x80, 0x7ff], which includes the British pound symbol, the code point value would be split over two bytes.

So, according to the formulae given in the RFC, the British Pound symbol, at unicode code point 0xa3, which is 10100011 in binary, would be split into 11000010 10100011, which is 0xC2 0xA3 in hex. So the maths is correct, and thus the URL encoder in FCKeditor is doing what it is supposed to do.

Note: See TracTickets for help on using tickets.
© 2003 – 2012 CKSource – Frederico Knabben. All rights reserved. | Terms of use | Privacy policy