Navigation Menu

Skip to content

Commit

Permalink
[ciot] (2) 
 now turns into a U+000A LINE FEED character
Browse files Browse the repository at this point in the history
git-svn-id: http://svn.whatwg.org/webapps@942 340c8d12-0b0e-0410-8428-c7bf67bfef74
  • Loading branch information
Hixie committed Jun 21, 2007
1 parent b03faf8 commit a38a682
Show file tree
Hide file tree
Showing 2 changed files with 52 additions and 55 deletions.
56 changes: 28 additions & 28 deletions index
Expand Up @@ -34779,12 +34779,12 @@ function receiver(e) {

<p>If one or more characters match the range, then take them all and
interpret the string of characters as a number (either hexadecimal or
decimal as appropriate).
decimal as appropriate).</p>

<p>If that number is in the range 128 to 159 (0x80 to 0x9F), then this is
a <a href="#parse">parse error</a>. In the following table, find the row
with that number in the first column, and return a character token for
the Unicode character given in the second column of that row.</p>
<p>If that number is one of the numbers in the first column of the
following table, then this is a <a href="#parse">parse error</a>. Find
the row with that number in the first column, and return a character
token for the Unicode character given in the second column of that row.</p>

<table>
<thead>
Expand All @@ -34794,6 +34794,13 @@ function receiver(e) {
<th colspan=2>Unicode character

<tbody>
<tr>
<td>0x0D

<td>U+000A

<td>LINE FEED (LF)

<tr>
<td>0x80

Expand Down Expand Up @@ -35051,13 +35058,6 @@ function receiver(e) {
</div>
</dl>

<p class=big-issue>It seems browsers convert CRs to LFs even as entities.
Should we also do that? If so, we should remove the CRs in the tree
construction phase.</p>
<!-- IE is not such a browser; you
can prove that by comparing id="&#xA;X" to id="&#xD;X" in terms of
getElementById('\nX') vs '\rX'. -->

<h4 id=tree-construction><span class=secno>8.2.4. </span><dfn
id=tree-construction0>Tree construction</dfn></h4>

Expand Down Expand Up @@ -35114,7 +35114,7 @@ function receiver(e) {
<dl class=switch>
<dt>A character token that <em>is</em> one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append that character</a>
Expand Down Expand Up @@ -35411,7 +35411,7 @@ function receiver(e) {

<dt>A character token that is not one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dt>An end-of-file token

Expand Down Expand Up @@ -35446,15 +35446,15 @@ function receiver(e) {

<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append that character</a>
to the <code>Document</code> node.</p>

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dt>A start tag token

Expand Down Expand Up @@ -36090,7 +36090,7 @@ simplified explanation instead:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -36146,7 +36146,7 @@ simplified explanation instead:

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dt>Any other start tag token

Expand All @@ -36171,7 +36171,7 @@ simplified explanation instead:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -36397,7 +36397,7 @@ simplified explanation instead:

<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dt>A comment token

Expand Down Expand Up @@ -36426,7 +36426,7 @@ simplified explanation instead:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -37455,7 +37455,7 @@ Put the following into the MathML namespace if parsed:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -37685,7 +37685,7 @@ Put the following into the MathML namespace if parsed:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -38131,7 +38131,7 @@ Put the following into the MathML namespace if parsed:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p>Process the token as it would be processed if the <a
Expand Down Expand Up @@ -38178,7 +38178,7 @@ Put the following into the MathML namespace if parsed:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -38247,7 +38247,7 @@ Put the following into the MathML namespace if parsed:
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p><a href="#append" title="append a character">Append the
Expand Down Expand Up @@ -38307,15 +38307,15 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dd>
<p>Process the token as it would be processed in <a href="#the-main0">the
main phase</a>.</p>

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE

<dt>A start tag token

Expand Down
51 changes: 24 additions & 27 deletions source
Expand Up @@ -32290,18 +32290,19 @@ function receiver(e) {

<p>If one or more characters match the range, then take them all
and interpret the string of characters as a number (either
hexadecimal or decimal as appropriate).
hexadecimal or decimal as appropriate).</p>

<p>If that number is in the range 128 to 159 (0x80 to 0x9F), then
this is a <span>parse error</span>. In the following table, find
the row with that number in the first column, and return a
character token for the Unicode character given in the second
column of that row.</p>
<p>If that number is one of the numbers in the first column of the
following table, then this is a <span>parse error</span>. Find the
row with that number in the first column, and return a character
token for the Unicode character given in the second column of that
row.</p>

<table>
<thead>
<tr><th>Number <th colspan=2>Unicode character
<tbody>
<tr><td>0x0D <td>U+000A <td>LINE FEED (LF)
<tr><td>0x80 <td>U+20AC <td>EURO SIGN ('&#x20AC')
<tr><td>0x81 <td>U+FFFD <td>REPLACEMENT CHARACTER
<tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK ('&#x201A')
Expand Down Expand Up @@ -32382,11 +32383,7 @@ function receiver(e) {

</dl>

<p class="big-issue">It seems browsers convert CRs to LFs even as
entities. Should we also do that? If so, we should remove the CRs in
the tree construction phase.</p> <!-- IE is not such a browser; you
can prove that by comparing id="&#xA;X" to id="&#xD;X" in terms of
getElementById('\nX') vs '\rX'. -->


<h4><dfn>Tree construction</dfn></h4>

Expand Down Expand Up @@ -32442,7 +32439,7 @@ function receiver(e) {

<dt>A character token that <em>is</em> one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or
U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append that character</span>
Expand Down Expand Up @@ -32589,7 +32586,7 @@ function receiver(e) {
<dt>An end tag token</dt>
<dt>A character token that is not one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>An end-of-file token</dt>
<dd>

Expand Down Expand Up @@ -32627,15 +32624,15 @@ function receiver(e) {

<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append that character</span>
to the <code>Document</code> node.</p>
</dd>

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag token</dt>
<dt>An end tag token</dt>
<dt>An end-of-file token</dt>
Expand Down Expand Up @@ -33241,7 +33238,7 @@ simplified explanation instead:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -33307,7 +33304,7 @@ simplified explanation instead:

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>Any other start tag token</dt>
<dd>

Expand Down Expand Up @@ -33336,7 +33333,7 @@ simplified explanation instead:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -33570,7 +33567,7 @@ simplified explanation instead:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dt>A comment token</dt>
<dt>A start tag whose tag name is one of: "head", "link", "style"</dt>
Expand Down Expand Up @@ -33602,7 +33599,7 @@ simplified explanation instead:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -34694,7 +34691,7 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -34933,7 +34930,7 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -35430,7 +35427,7 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p>Process the token as it would be processed if the
Expand Down Expand Up @@ -35483,7 +35480,7 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -35556,7 +35553,7 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
Expand Down Expand Up @@ -35624,15 +35621,15 @@ Put the following into the MathML namespace if parsed:

<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Process the token as it would be processed in <span>the main
phase</span>.</p>
</dd>

<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag token</dt>
<dt>An end tag token</dt>
<dd>
Expand Down

0 comments on commit a38a682

Please sign in to comment.