Short URL: http://html5.org/r/3853
| SVN | Bug | Comment | Time (UTC) |
|---|---|---|---|
| 3853 | Add note about why we strip all BOMs. | 2009-09-15 04:06 |
Index: source =================================================================== --- source (revision 3852) +++ source (revision 3853) @@ -76644,6 +76644,12 @@ <p>One leading U+FEFF BYTE ORDER MARK character must be ignored if any are present.</p> + <p class="note">The requirement to strip a U+FEFF BYTE ORDER MARK + character regardless of whether that character was used to determine + the byte order is a <span>willful violation</span> of Unicode, + motivated by a desire to increase the resilience of user agents in + the face of naïve transcoders.</p> + <p>All U+0000 NULL characters in the input must be replaced by U+FFFD REPLACEMENT CHARACTERs. Any occurrences of such characters is a <span>parse error</span>.</p>