Skip to content

Commit

Permalink
[cgiowt] (2) Make 
 map to U+000D and not U+000A. This has ramifi…
Browse files Browse the repository at this point in the history
…cations throughout the parser.

Fixing http://www.w3.org/Bugs/Public/show_bug.cgi?id=9144

git-svn-id: http://svn.whatwg.org/webapps@4933 340c8d12-0b0e-0410-8428-c7bf67bfef74
  • Loading branch information
Hixie committed Apr 1, 2010
1 parent 6996947 commit de1d0bd
Show file tree
Hide file tree
Showing 3 changed files with 93 additions and 63 deletions.
52 changes: 31 additions & 21 deletions complete.html
Expand Up @@ -73323,6 +73323,11 @@ <h5 id=newlines><span class=secno>12.1.3.1 </span>Newlines</h5>
LINE FEED (LF) characters, or pairs of U+000D CARRIAGE RETURN (CR),
U+000A LINE FEED (LF) characters in that order.</p>

<p>Where <a href=#syntax-charref title=syntax-charref>character references</a>
are allowed, a character reference of a U+000A LINE FEED (LF)
character (but not a U+000D CARRIAGE RETURN (CR) character) also
represents a <a href=#syntax-newlines title=syntax-newlines>newline</a>.</p>


<h4 id=character-references><span class=secno>12.1.4 </span>Character references</h4>

Expand Down Expand Up @@ -73367,9 +73372,9 @@ <h4 id=character-references><span class=secno>12.1.4 </span>Character references
(;).</dd>

</dl><p>The numeric character reference forms described above are allowed
to reference any Unicode code point other than U+0000, permanently
undefined Unicode characters (noncharacters), and control characters
other than <a href=#space-character title="space character">space
to reference any Unicode code point other than U+0000, U+000D,
permanently undefined Unicode characters (noncharacters), and
control characters other than <a href=#space-character title="space character">space
characters</a>.</p>

<p>An <dfn id=syntax-ambiguous-ampersand title=syntax-ambiguous-ampersand>ambiguous
Expand Down Expand Up @@ -76700,7 +76705,7 @@ <h5 id=tokenizing-character-references><span class=secno>12.2.4.70 </span>Tokeni

<table><thead><tr><th>Number <th colspan=2>Unicode character
<tbody><tr><td>0x00 <td>U+FFFD <td>REPLACEMENT CHARACTER
<tr><td>0x0D <td>U+000A <td>LINE FEED (LF)
<tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR)
<tr><td>0x80 <td>U+20AC <td>EURO SIGN (&euro;)
<tr><td>0x81 <td>U+0081 <td>&lt;control&gt;
<tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK (&sbquo;)
Expand Down Expand Up @@ -77125,7 +77130,7 @@ <h5 id=the-initial-insertion-mode><span class=secno>12.2.5.4 </span>The "<dfn ti

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p>
</dd>
Expand Down Expand Up @@ -77331,7 +77336,7 @@ <h5 id=the-initial-insertion-mode><span class=secno>12.2.5.4 </span>The "<dfn ti

<dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p>
</dd>
Expand Down Expand Up @@ -77403,7 +77408,7 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p> <!-- :-( -->
</dd>
Expand Down Expand Up @@ -77469,7 +77474,7 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p><a href=#insert-a-character title="insert a character">Insert the character</a> into
the <a href=#current-node>current node</a>.</p>
Expand Down Expand Up @@ -77654,7 +77659,7 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df

<dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dt>A comment token</dt>
<dt>A start tag whose tag name is one of: "link", "meta", "noframes", "style"</dt>
<dd>
Expand Down Expand Up @@ -77691,7 +77696,7 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p><a href=#insert-a-character title="insert a character">Insert the character</a> into
the <a href=#current-node>current node</a>.</p>
Expand Down Expand Up @@ -77789,8 +77794,8 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df
character</a> into the <a href=#current-node>current node</a>.</p>

<p>If the token is not one of U+0009 CHARACTER TABULATION, U+000A
LINE FEED (LF), U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN
(CR),--> or U+0020 SPACE, then set the <a href=#frameset-ok-flag>frameset-ok
LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN
(CR), or U+0020 SPACE, then set the <a href=#frameset-ok-flag>frameset-ok
flag</a> to "not ok".</p>

</dd>
Expand Down Expand Up @@ -77986,6 +77991,9 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df
one. (Newlines at the start of <code><a href=#the-pre-element>pre</a></code> blocks are
ignored as an authoring convenience.)</p>

<!-- <pre>[CR]X will eat the [CR], <pre>&#x10;X will eat the
&#x10;, but <pre>&#x13;X will not eat the &#x13;. -->

<p>Set the <a href=#frameset-ok-flag>frameset-ok flag</a> to "not ok".</p>

</dd>
Expand Down Expand Up @@ -78722,6 +78730,8 @@ <h5 id=the-before-head-insertion-mode><span class=secno>12.2.5.6 </span>The "<df
token, then ignore that token and move on to the next
one. (Newlines at the start of <code><a href=#the-textarea-element>textarea</a></code> elements are
ignored as an authoring convenience.)</li>

<!-- see comment in <pre> start tag bit -->

<li><p>Switch the tokenizer to the <a href=#rcdata-state>RCDATA
state</a>.</li>
Expand Down Expand Up @@ -79349,7 +79359,7 @@ <h5 id=parsing-main-intabletext><span class=secno>12.2.5.13 </span>The "<dfn tit
<p>If any of the tokens in the <var><a href=#pending-table-character-tokens>pending table character
tokens</a></var> list are character tokens that are not one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED
(FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE, then
(FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE, then
reprocess those character tokens using the rules given in the
"anything else" entry in the <a href=#parsing-main-intable title="insertion mode: in
table">in table</a>" insertion mode.</p>
Expand Down Expand Up @@ -79428,7 +79438,7 @@ <h5 id=parsing-main-intabletext><span class=secno>12.2.5.13 </span>The "<dfn tit

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p><a href=#insert-a-character title="insert a character">Insert the character</a> into
the <a href=#current-node>current node</a>.</p>
Expand Down Expand Up @@ -79974,8 +79984,8 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=
character</a> into the <a href=#current-node>current node</a>.</p>

<p>If the token is not one of U+0009 CHARACTER TABULATION, U+000A
LINE FEED (LF), U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN
(CR),--> or U+0020 SPACE, then set the <a href=#frameset-ok-flag>frameset-ok
LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN
(CR), or U+0020 SPACE, then set the <a href=#frameset-ok-flag>frameset-ok
flag</a> to "not ok".</p>

</dd>
Expand Down Expand Up @@ -80195,7 +80205,7 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p>Process the token <a href=#using-the-rules-for>using the rules for</a> the "<a href=#parsing-main-inbody title="insertion mode: in body">in body</a>" <a href=#insertion-mode>insertion
mode</a>.</p>
Expand Down Expand Up @@ -80253,7 +80263,7 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=

<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p><a href=#insert-a-character title="insert a character">Insert the character</a> into
the <a href=#current-node>current node</a>.</p>
Expand Down Expand Up @@ -80347,7 +80357,7 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=
<!-- due to rules in the "in frameset" mode, this can't be entered in the fragment case -->
<dl class=switch><dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dd>
<p><a href=#insert-a-character title="insert a character">Insert the character</a> into
the <a href=#current-node>current node</a>.</p>
Expand Down Expand Up @@ -80408,7 +80418,7 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=
<dt>A DOCTYPE token</dt>
<dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dt>A start tag whose tag name is "html"</dt>
<dd>
<p>Process the token <a href=#using-the-rules-for>using the rules for</a> the "<a href=#parsing-main-inbody title="insertion mode: in body">in body</a>" <a href=#insertion-mode>insertion
Expand Down Expand Up @@ -80442,7 +80452,7 @@ <h5 id=parsing-main-inselect><span class=secno>12.2.5.19 </span>The "<dfn title=
<dt>A DOCTYPE token</dt>
<dt>A character token that is one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
<dt>A start tag whose tag name is "html"</dt>
<dd>
<p>Process the token <a href=#using-the-rules-for>using the rules for</a> the "<a href=#parsing-main-inbody title="insertion mode: in body">in body</a>" <a href=#insertion-mode>insertion
Expand Down

0 comments on commit de1d0bd

Please sign in to comment.