PITS /
01229: IDN links are incorrectly encoded
Summary: IDN links are incorrectly encoded
Created: 2010-10-13 17:40
Status: Open
Category: Bug
From: Petko
Assigned:
Priority: 22
Version: 2.2.18
OS:
Description: Domain names containing international characters use a different standard encoding (Punycode) than the URL path (URL encoding).
www.domain.ext/Some/Path
For exemple,
- the domain
www.акцент.bg
should be encoded towww.xn--80akonzv.bg
or left as-is in the (UTF-8) page text, but it is currently incorrectly percent-encoded. - a path like
/Петко/
is correctly percent-encoded to/%D0%9F%D0%B5%D1%82%D0%BA%D0%BE/
Currently PmWiki URL-encodes the full address -- it should either apply the Punycode function to the domain part, or leave the domain part as text and let the browser figure it out. Most browsers as of 2010 are able to follow such links.
References :