HTML Standard Tracker

Diff (omit for latest revision)
Filter

Short URL: http://html5.org/r/943

File a bug

SVNBugCommentTime (UTC)
943[Conformance Checkers] [Internet Explorer] [Opera] [Webkit] [Tools] Be explicit about what an invalid Unicode character is.2007-06-22 01:44
Index: source
===================================================================
--- source	(revision 942)
+++ source	(revision 943)
@@ -32337,13 +32337,14 @@
       <tr><td>0x9F <td>U+0178 <td>LATIN CAPITAL LETTER Y WITH DIAERESIS ('&#x0178')
     </table>
 
-    <p>Otherwise, if the number is not a valid Unicode character
-    (e.g. if the number is higher than 1114111), or if the number is
-    zero, then return a character token for the U+FFFD REPLACEMENT
+    <p>Otherwise, if the number is zero, if the number is higher than
+    0x10FFFF, or if it's one of the surrogate characters (characters
+    in the range 0xD800 to 0xDFFF), then this is a <span>parse
+    error</span>; return a character token for the U+FFFD REPLACEMENT
     CHARACTER character instead.</p>
 
     <p>Otherwise, return a character token for the Unicode character
-    whose code point is that number.
+    whose code point is that number.</p>
 
    </dd>
 

|