Short URL: http://html5.org/r/2094
| SVN | Bug | Comment | Time (UTC) |
|---|---|---|---|
| 2094 | Turns out that Zs isn't what we want; we want White_Space. (credit: w) | 2008-08-21 09:46 |
Index: source
===================================================================
--- source (revision 2093)
+++ source (revision 2094)
@@ -1044,6 +1044,10 @@
TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and
U+000D CARRIAGE RETURN (CR).</p>
+ <p>The <dfn title="White_Space">White_Space characters</dfn> are
+ those that have the Unicode property "White_Space". <a
+ href="#refsUNICODE">[UNICODE]</a></p>
+
<p>Some of the micro-parsers described below follow the pattern of
having an <var title="">input</var> variable that holds the string
being parsed, and having a <var title="">position</var> variable
@@ -1077,10 +1081,10 @@
<p>The step <dfn>skip whitespace</dfn> means that the user agent
must <span>collect a sequence of characters</span> that are <span
title="space character">space characters</span>. The step <dfn>skip
- Zs characters</dfn> means that the user agent must <span>collect a
- sequence of characters</span> that are in the Unicode character
- class Zs. In both cases, the collected characters are not used. <a
- href="#refsUNICODE">[UNICODE]</a></p>
+ White_Space characters</dfn> means that the user agent must
+ <span>collect a sequence of characters</span> that are
+ <span>White_Space</span> characters. In both cases, the collected
+ characters are not used. <a href="#refsUNICODE">[UNICODE]</a></p>
<h4>Boolean attributes</h4>
@@ -1464,9 +1468,9 @@
sub-algorithm in step 2.</li>
<li>Starting with the character immediately after the last one
- examined by the sub-algorithm in step 2, skip any characters in the
- string that are in the Unicode character class Zs (this might match
- zero characters). <a href="#refsUNICODE">[UNICODE]</a></li>
+ examined by the sub-algorithm in step 2, skip all
+ <span>White_Space</span> characters in the string (this might match
+ zero characters).</li>
<li>If there are still further characters in the string, and the
next character in the string is a <span>valid denominator
@@ -1493,9 +1497,9 @@
sub-algorithm in step 9.</li>
<li>Starting with the character immediately after the last one
- examined by the sub-algorithm in step 9, skip any characters in the
- string that are in the Unicode character class Zs (this might match
- zero characters). <a href="#refsUNICODE">[UNICODE]</a></li>
+ examined by the sub-algorithm in step 9, skip all
+ <span>White_Space</span> characters in the string (this might match
+ zero characters).</li>
<li>If there are still further characters in the string, and the
next character in the string is a <span>valid denominator
@@ -2237,7 +2241,7 @@
returned as the result of the algorithm.</p></li>
<!-- LEADING WHITESPACE -->
- <li><p>For the "in content" variant: <span>skip Zs
+ <li><p>For the "in content" variant: <span>skip White_Space
characters</span>; for the "in attributes" variant: <span>skip
whitespace</span>.</p></li><!-- XXX skip whitespace in attribute?
really? -->
@@ -2331,7 +2335,7 @@
error, with just a date. -->
<!-- WHITESPACE -->
- <li><p>For the "in content" variant: <span>skip Zs
+ <li><p>For the "in content" variant: <span>skip White_Space
characters</span>; for the "in attributes" variant: <span>skip
whitespace</span>.</p></li>
@@ -2339,7 +2343,7 @@
LATIN CAPITAL LETTER T, then move <var title="">position</var>
forwards one character.</p></li>
- <li><p>For the "in content" variant: <span>skip Zs
+ <li><p>For the "in content" variant: <span>skip White_Space
characters</span>; for the "in attributes" variant: <span>skip
whitespace</span>.</p></li>
@@ -2440,7 +2444,7 @@
<ol>
- <li><p>For the "in content" variant: <span>skip Zs
+ <li><p>For the "in content" variant: <span>skip White_Space
characters</span>; for the "in attributes" variant: <span>skip
whitespace</span>.</p></li>
@@ -2541,7 +2545,7 @@
</li>
- <li><p>For the "in content" variant: <span>skip Zs
+ <li><p>For the "in content" variant: <span>skip White_Space
characters</span>; for the "in attributes" variant: <span>skip
whitespace</span>.</p></li>
@@ -23618,8 +23622,7 @@
<p>A data cell is said to be an <dfn>empty data cell</dfn> if it
contains no elements and its text content, if any, consists only of
- characters in the Unicode character class Zs. <a
- href="#refsUNICODE">[UNICODE]</a></p>
+ <span>White_Space</span> characters.</p>
<p>User agents may remove <span title="empty data cell">empty data
cells</span> when analyzing data in a <span