Skip to content

Commit

Permalink
[giow] (2) Add some more locales to the default encoding logic.
Browse files Browse the repository at this point in the history
Fixing https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089
Affected topics: HTML Syntax and Parsing

git-svn-id: http://svn.whatwg.org/webapps@8258 340c8d12-0b0e-0410-8428-c7bf67bfef74
  • Loading branch information
Hixie committed Nov 6, 2013
1 parent 9034dab commit f6e22a9
Show file tree
Hide file tree
Showing 3 changed files with 119 additions and 15 deletions.
42 changes: 37 additions & 5 deletions complete.html
Expand Up @@ -84382,8 +84382,16 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<!-- az-Cyrl-AZ, Azeri (Cyrillic, Azerbaijan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

<!-- ba-RU, Bashkir (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ba wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>ba
<td>Bashkir
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be, Belarusian, is not listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- be, Belarusian, was not initially listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- further feedback has changed this: -->
<tr><td>be
<td>Belarusian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be-BY, Belarusian (Belarus), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

Expand Down Expand Up @@ -84481,7 +84489,11 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<td>Japanese
<td>Shift_JIS <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- kk, Kazakh, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- kk, Kazakh, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>kk
<td>Kazakh
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- kl-GL, Greenlandic (Greenland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -84495,7 +84507,11 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<td>Kurdish
<td>windows-1254 <!-- Best guess -->

<!-- ky, Kyrgyz, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ky, Kyrgyz, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>ky
<td>Kyrgyz
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- lb-LU, Luxembourgish (Luxembourg), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -84507,7 +84523,11 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<td>Latvian
<td>windows-1257 <!-- Windows Vista and Chrome agreed (but disagreed with Firefox, which thought the encoding should be ISO-8859-13) -->

<!-- mk, Macedonian, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- mk, Macedonian, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>mk
<td>Macedonian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- ml, Malayalam, uses windows-1252: Firefox and Chrome agreed -->

Expand Down Expand Up @@ -84562,6 +84582,10 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<!-- rw-RW, Kinyarwanda (Rwanda), uses windows-1252: Windows Vista and Firefox agreed -->

<!-- sah-RU, Yakut (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- sah wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>sah
<td>Yakut
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- se-FI, Sami, Northern (Finland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down Expand Up @@ -84610,6 +84634,10 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<!-- te, Telugu, uses windows-1252: Firefox and Chrome agreed -->

<!-- tg-Cyrl-TJ, Tajik (Cyrillic, Tajikistan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tg wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>tg
<td>Tajik
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<tr><td>th
<td>Thai
Expand All @@ -84623,7 +84651,11 @@ <h5 id=determining-the-character-encoding><span class=secno>12.2.2.2 </span>Dete
<td>Turkish
<td>windows-1254 <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- tt, Tatar, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tt, Tatar, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>tt
<td>Tatar
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- tzm-Latn-DZ, Tamazight (Latin, Algeria), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down
42 changes: 37 additions & 5 deletions index
Expand Up @@ -84382,8 +84382,16 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<!-- az-Cyrl-AZ, Azeri (Cyrillic, Azerbaijan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

<!-- ba-RU, Bashkir (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ba wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>ba
<td>Bashkir
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be, Belarusian, is not listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- be, Belarusian, was not initially listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- further feedback has changed this: -->
<tr><td>be
<td>Belarusian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be-BY, Belarusian (Belarus), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

Expand Down Expand Up @@ -84481,7 +84489,11 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<td>Japanese
<td>Shift_JIS <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- kk, Kazakh, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- kk, Kazakh, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>kk
<td>Kazakh
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- kl-GL, Greenlandic (Greenland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -84495,7 +84507,11 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<td>Kurdish
<td>windows-1254 <!-- Best guess -->

<!-- ky, Kyrgyz, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ky, Kyrgyz, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>ky
<td>Kyrgyz
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- lb-LU, Luxembourgish (Luxembourg), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -84507,7 +84523,11 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<td>Latvian
<td>windows-1257 <!-- Windows Vista and Chrome agreed (but disagreed with Firefox, which thought the encoding should be ISO-8859-13) -->

<!-- mk, Macedonian, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- mk, Macedonian, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>mk
<td>Macedonian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- ml, Malayalam, uses windows-1252: Firefox and Chrome agreed -->

Expand Down Expand Up @@ -84562,6 +84582,10 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<!-- rw-RW, Kinyarwanda (Rwanda), uses windows-1252: Windows Vista and Firefox agreed -->

<!-- sah-RU, Yakut (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- sah wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>sah
<td>Yakut
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- se-FI, Sami, Northern (Finland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down Expand Up @@ -84610,6 +84634,10 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<!-- te, Telugu, uses windows-1252: Firefox and Chrome agreed -->

<!-- tg-Cyrl-TJ, Tajik (Cyrillic, Tajikistan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tg wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr><td>tg
<td>Tajik
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<tr><td>th
<td>Thai
Expand All @@ -84623,7 +84651,11 @@ dictionary <dfn id=storageeventinit>StorageEventInit</dfn> : <a href=#eventinit>
<td>Turkish
<td>windows-1254 <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- tt, Tatar, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tt, Tatar, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr><td>tt
<td>Tatar
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- tzm-Latn-DZ, Tamazight (Latin, Algeria), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down
50 changes: 45 additions & 5 deletions source
Expand Up @@ -93845,8 +93845,18 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<!-- az-Cyrl-AZ, Azeri (Cyrillic, Azerbaijan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

<!-- ba-RU, Bashkir (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ba wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr>
<td>ba
<td>Bashkir
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be, Belarusian, is not listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- be, Belarusian, was not initially listed here because Windows Vista wanted windows-1251, Chrome wanted <none>, and Firefox wanted ISO-8859-5 -->
<!-- further feedback has changed this: -->
<tr>
<td>be
<td>Belarusian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- be-BY, Belarusian (Belarus), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->

Expand Down Expand Up @@ -93952,7 +93962,12 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<td>Japanese
<td>Shift_JIS <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- kk, Kazakh, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- kk, Kazakh, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr>
<td>kk
<td>Kazakh
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- kl-GL, Greenlandic (Greenland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -93968,7 +93983,12 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<td>Kurdish
<td>windows-1254 <!-- Best guess -->

<!-- ky, Kyrgyz, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- ky, Kyrgyz, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr>
<td>ky
<td>Kyrgyz
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- lb-LU, Luxembourgish (Luxembourg), uses windows-1252: Windows Vista and Firefox agreed -->

Expand All @@ -93982,7 +94002,12 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<td>Latvian
<td>windows-1257 <!-- Windows Vista and Chrome agreed (but disagreed with Firefox, which thought the encoding should be ISO-8859-13) -->

<!-- mk, Macedonian, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- mk, Macedonian, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr>
<td>mk
<td>Macedonian
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- ml, Malayalam, uses windows-1252: Firefox and Chrome agreed -->

Expand Down Expand Up @@ -94039,6 +94064,11 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<!-- rw-RW, Kinyarwanda (Rwanda), uses windows-1252: Windows Vista and Firefox agreed -->

<!-- sah-RU, Yakut (Russia), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- sah wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr>
<td>sah
<td>Yakut
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- se-FI, Sami, Northern (Finland), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down Expand Up @@ -94090,6 +94120,11 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<!-- te, Telugu, uses windows-1252: Firefox and Chrome agreed -->

<!-- tg-Cyrl-TJ, Tajik (Cyrillic, Tajikistan), is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tg wasn't listed at all because none of the sources knew about it. However, further feedback has changed this: -->
<tr>
<td>tg
<td>Tajik
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<tr>
<td>th
Expand All @@ -94105,7 +94140,12 @@ dictionary <dfn>StorageEventInit</dfn> : <span>EventInit</span> {
<td>Turkish
<td>windows-1254 <!-- Windows Vista, Chrome, and Firefox agreed -->

<!-- tt, Tatar, is not listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- tt, Tatar, was not initially listed here because neither Chrome nor Firefox knew about it. For what it's worth, Windows Vista wanted windows-1251 -->
<!-- further feedback has changed this: -->
<tr>
<td>tt
<td>Tatar
<td>windows-1251 <!-- per https://www.w3.org/Bugs/Public/show_bug.cgi?id=23089 -->

<!-- tzm-Latn-DZ, Tamazight (Latin, Algeria), uses windows-1252: Windows Vista and Firefox agreed -->

Expand Down

0 comments on commit f6e22a9

Please sign in to comment.