HTML Standard Tracker

Diff (omit for latest revision)
Filter

Short URL: http://html5.org/r/2094

File a bug

SVNBugCommentTime (UTC)
2094Turns out that Zs isn't what we want; we want White_Space. (credit: w)2008-08-21 09:46
Index: source
===================================================================
--- source	(revision 2093)
+++ source	(revision 2094)
@@ -1044,6 +1044,10 @@
   TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and
   U+000D CARRIAGE RETURN (CR).</p>
 
+  <p>The <dfn title="White_Space">White_Space characters</dfn> are
+  those that have the Unicode property "White_Space". <a
+  href="#refsUNICODE">[UNICODE]</a></p>
+
   <p>Some of the micro-parsers described below follow the pattern of
   having an <var title="">input</var> variable that holds the string
   being parsed, and having a <var title="">position</var> variable
@@ -1077,10 +1081,10 @@
   <p>The step <dfn>skip whitespace</dfn> means that the user agent
   must <span>collect a sequence of characters</span> that are <span
   title="space character">space characters</span>. The step <dfn>skip
-  Zs characters</dfn> means that the user agent must <span>collect a
-  sequence of characters</span> that are in the Unicode character
-  class Zs. In both cases, the collected characters are not used. <a
-  href="#refsUNICODE">[UNICODE]</a></p>
+  White_Space characters</dfn> means that the user agent must
+  <span>collect a sequence of characters</span> that are
+  <span>White_Space</span> characters. In both cases, the collected
+  characters are not used. <a href="#refsUNICODE">[UNICODE]</a></p>
 
 
   <h4>Boolean attributes</h4>
@@ -1464,9 +1468,9 @@
    sub-algorithm in step 2.</li>
 
    <li>Starting with the character immediately after the last one
-   examined by the sub-algorithm in step 2, skip any characters in the
-   string that are in the Unicode character class Zs (this might match
-   zero characters). <a href="#refsUNICODE">[UNICODE]</a></li>
+   examined by the sub-algorithm in step 2, skip all
+   <span>White_Space</span> characters in the string (this might match
+   zero characters).</li>
 
    <li>If there are still further characters in the string, and the
    next character in the string is a <span>valid denominator
@@ -1493,9 +1497,9 @@
    sub-algorithm in step 9.</li>
 
    <li>Starting with the character immediately after the last one
-   examined by the sub-algorithm in step 9, skip any characters in the
-   string that are in the Unicode character class Zs (this might match
-   zero characters). <a href="#refsUNICODE">[UNICODE]</a></li>
+   examined by the sub-algorithm in step 9, skip all
+   <span>White_Space</span> characters in the string (this might match
+   zero characters).</li>
 
    <li>If there are still further characters in the string, and the
    next character in the string is a <span>valid denominator
@@ -2237,7 +2241,7 @@
    returned as the result of the algorithm.</p></li>
 
    <!-- LEADING WHITESPACE -->
-   <li><p>For the "in content" variant: <span>skip Zs
+   <li><p>For the "in content" variant: <span>skip White_Space
    characters</span>; for the "in attributes" variant: <span>skip
    whitespace</span>.</p></li><!-- XXX skip whitespace in attribute?
    really? -->
@@ -2331,7 +2335,7 @@
      error, with just a date. -->
 
      <!-- WHITESPACE -->
-     <li><p>For the "in content" variant: <span>skip Zs
+     <li><p>For the "in content" variant: <span>skip White_Space
      characters</span>; for the "in attributes" variant: <span>skip
      whitespace</span>.</p></li>
 
@@ -2339,7 +2343,7 @@
      LATIN CAPITAL LETTER T, then move <var title="">position</var>
      forwards one character.</p></li>
 
-     <li><p>For the "in content" variant: <span>skip Zs
+     <li><p>For the "in content" variant: <span>skip White_Space
      characters</span>; for the "in attributes" variant: <span>skip
      whitespace</span>.</p></li>
 
@@ -2440,7 +2444,7 @@
 
     <ol>
 
-     <li><p>For the "in content" variant: <span>skip Zs
+     <li><p>For the "in content" variant: <span>skip White_Space
      characters</span>; for the "in attributes" variant: <span>skip
      whitespace</span>.</p></li>
 
@@ -2541,7 +2545,7 @@
 
    </li>
 
-   <li><p>For the "in content" variant: <span>skip Zs
+   <li><p>For the "in content" variant: <span>skip White_Space
    characters</span>; for the "in attributes" variant: <span>skip
    whitespace</span>.</p></li>
 
@@ -23618,8 +23622,7 @@
 
   <p>A data cell is said to be an <dfn>empty data cell</dfn> if it
   contains no elements and its text content, if any, consists only of
-  characters in the Unicode character class Zs. <a
-  href="#refsUNICODE">[UNICODE]</a></p>
+  <span>White_Space</span> characters.</p>
 
   <p>User agents may remove <span title="empty data cell">empty data
   cells</span> when analyzing data in a <span

|