util: implement WHATWG Encoding Standard API

Provide an (initially experimental) implementation of the WHATWG Encoding Standard API (`TextDecoder` and `TextEncoder`). The is the same API implemented on the browser side. By default, with small-icu, only the UTF-8, UTF-16le and UTF-16be decoders are supported. With full-icu enabled, every encoding other than iso-8859-16 is supported. This provides a basic test, but does not include the full web platform tests. Note: many of the web platform tests for this would fail by default because we ship with small-icu by default. A process warning will be emitted on first use to indicate that the API is still experimental. No runtime flag is required to use the feature. Refs: https://encoding.spec.whatwg.org/ PR-URL: https://github.com/nodejs/node/pull/13644 Reviewed-By: Timothy Gu <timothygu99@gmail.com> Reviewed-By: Matteo Collina <matteo.collina@gmail.com>
8 years ago · ed21cb1774
12 changed files with 1189 additions and 16 deletions
--- a/doc/api/buffer.md
+++ b/doc/api/buffer.md
@ -193,11 +193,12 @@ The character encodings currently supported by Node.js include:

 * `'hex'` - Encode each byte as two hexadecimal characters.

-*Note*: Today's browsers follow the [WHATWG spec] which aliases both 'latin1'
-and ISO-8859-1 to win-1252. This means that while doing something like
-`http.get()`, if the returned charset is one of those listed in the WHATWG spec
-it's possible that the server actually returned win-1252-encoded data, and
-using `'latin1'` encoding may incorrectly decode the characters.
+*Note*: Today's browsers follow the [WHATWG Encoding Standard][] which aliases
+both 'latin1' and ISO-8859-1 to win-1252. This means that while doing something
+like `http.get()`, if the returned charset is one of those listed in the WHATWG
+specification it is possible that the server actually returned
+win-1252-encoded data, and using `'latin1'` encoding may incorrectly decode the
+characters.

 ## Buffers and TypedArray
 <!-- YAML
@ -2662,7 +2663,6 @@ buf.fill(0);
 console.log(buf);
 ```

-
 ## Buffer Constants
 <!-- YAML
 added: 8.2.0
@ -2730,5 +2730,5 @@ This value may depend on the JS engine that is being used.
 [`util.inspect()`]: util.html#util_util_inspect_object_options
 [RFC1345]: https://tools.ietf.org/html/rfc1345
 [RFC4648, Section 5]: https://tools.ietf.org/html/rfc4648#section-5
-[WHATWG spec]: https://encoding.spec.whatwg.org/
+[WHATWG Encoding Standard]: https://encoding.spec.whatwg.org/
 [iterator]: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Iteration_protocols
--- a/doc/api/util.md
+++ b/doc/api/util.md
@ -536,6 +536,156 @@ added: v8.0.0
 A Symbol that can be used to declare custom promisified variants of functions,
 see [Custom promisified functions][].

+### Class: util.TextDecoder
+<!-- YAML
+added: REPLACEME
+-->
+
+> Stability: 1 - Experimental
+
+An implementation of the [WHATWG Encoding Standard][] `TextDecoder` API.
+
+```js
+const decoder = new TextDecoder('shift_jis');
+let string = '';
+let buffer;
+while (buffer = getNextChunkSomehow()) {
+  string += decoder.decode(buffer, { stream: true });
+}
+string += decoder.decode(); // end-of-stream
+```
+
+#### WHATWG Supported Encodings
+
+Per the [WHATWG Encoding Standard][], the encodings supported by the
+`TextDecoder` API are outlined in the tables below. For each encoding,
+one or more aliases may be used. Support for some encodings is enabled
+only when Node.js is using the full ICU data.
+
+##### Encodings Supported By Default
+
+| Encoding    | Aliases                           |
+| ----------- | --------------------------------- |
+| `'utf8'`    | `'unicode-1-1-utf-8'`, `'utf-8'`  |
+| `'utf-16be'`|                                   |
+| `'utf-16le'`| `'utf-16'`                        |
+
+##### Encodings Requiring Full-ICU
+
+| Encoding          | Aliases                          |
+| ----------------- | -------------------------------- |
+| `'ibm866'`        | `'866'`, `'cp866'`, `'csibm866'` |
+| `'iso-8859-2'`    | `'csisolatin2'`, `'iso-ir-101'`, `'iso8859-2'`, `'iso88592'`, `'iso_8859-2'`, `'iso_8859-2:1987'`, `'l2'`, `'latin2'` |
+| `'iso-8859-3'`    | `'csisolatin3'`, `'iso-ir-109'`, `'iso8859-3'`, `'iso88593'`, `'iso_8859-3'`, `'iso_8859-3:1988'`, `'l3'`, `'latin3'` |
+| `'iso-8859-4'`    | `'csisolatin4'`, `'iso-ir-110'`, `'iso8859-4'`, `'iso88594'`, `'iso_8859-4'`, `'iso_8859-4:1988'`, `'l4'`, `'latin4'` |
+| `'iso-8859-5'`    | `'csisolatincyrillic'`, `'cyrillic'`, `'iso-ir-144'`, `'iso8859-5'`, `'iso88595'`, `'iso_8859-5'`, `'iso_8859-5:1988'`|
+| `'iso-8859-6'`    | `'arabic'`, `'asmo-708'`, `'csiso88596e'`, `'csiso88596i'`, `'csisolatinarabic'`, `'ecma-114'`, `'iso-8859-6-e'`, `'iso-8859-6-i'`, `'iso-ir-127'`, `'iso8859-6'`, `'iso88596'`, `'iso_8859-6'`, `'iso_8859-6:1987'` |
+| `'iso-8859-7'`    | `'csisolatingreek'`, `'ecma-118'`, `'elot_928'`, `'greek'`, `'greek8'`, `'iso-ir-126'`, `'iso8859-7'`, `'iso88597'`, `'iso_8859-7'`, `'iso_8859-7:1987'`, `'sun_eu_greek'` |
+| `'iso-8859-8'`    | `'csiso88598e'`, `'csisolatinhebrew'`, `'hebrew'`, `'iso-8859-8-e'`, `'iso-ir-138'`, `'iso8859-8'`, `'iso88598'`, `'iso_8859-8'`, `'iso_8859-8:1988'`, `'visual'` |
+| `'iso-8859-8-i'`  | `'csiso88598i'`, `'logical'` |
+| `'iso-8859-10'`   | `'csisolatin6'`, `'iso-ir-157'`, `'iso8859-10'`, `'iso885910'`, `'l6'`, `'latin6'` |
+| `'iso-8859-13'`   | `'iso8859-13'`, `'iso885913'` |
+| `'iso-8859-14'`   | `'iso8859-14'`, `'iso885914'` |
+| `'iso-8859-15'`   | `'csisolatin9'`, `'iso8859-15'`, `'iso885915'`, `'iso_8859-15'`, `'l9'` |
+| `'koi8-r'`        | `'cskoi8r'`, `'koi'`, `'koi8'`, `'koi8_r'` |
+| `'koi8-u'`        | `'koi8-ru'` |
+| `'macintosh'`     | `'csmacintosh'`, `'mac'`, `'x-mac-roman'` |
+| `'windows-874'`   | `'dos-874'`, `'iso-8859-11'`, `'iso8859-11'`, `'iso885911'`, `'tis-620'` |
+| `'windows-1250'`  | `'cp1250'`, `'x-cp1250'` |
+| `'windows-1251'`  | `'cp1251'`, `'x-cp1251'` |
+| `'windows-1252'`  | `'ansi_x3.4-1968'`, `'ascii'`, `'cp1252'`, `'cp819'`, `'csisolatin1'`, `'ibm819'`, `'iso-8859-1'`, `'iso-ir-100'`, `'iso8859-1'`, `'iso88591'`, `'iso_8859-1'`, `'iso_8859-1:1987'`, `'l1'`, `'latin1'`, `'us-ascii'`, `'x-cp1252'` |
+| `'windows-1253'`  | `'cp1253'`, `'x-cp1253'` |
+| `'windows-1254'`  | `'cp1254'`, `'csisolatin5'`, `'iso-8859-9'`, `'iso-ir-148'`, `'iso8859-9'`, `'iso88599'`, `'iso_8859-9'`, `'iso_8859-9:1989'`, `'l5'`, `'latin5'`, `'x-cp1254'` |
+| `'windows-1255'`  | `'cp1255'`, `'x-cp1255'` |
+| `'windows-1256'`  | `'cp1256'`, `'x-cp1256'` |
+| `'windows-1257'`  | `'cp1257'`, `'x-cp1257'` |
+| `'windows-1258'`  | `'cp1258'`, `'x-cp1258'` |
+| `'x-mac-cyrillic'`| `'x-mac-ukrainian'` |
+| `'gbk'`           | `'chinese'`, `'csgb2312'`, `'csiso58gb231280'`, `'gb2312'`, `'gb_2312'`, `'gb_2312-80'`, `'iso-ir-58'`, `'x-gbk'` |
+| `'gb18030'`       | |
+| `'big5'`          | `'big5-hkscs'`, `'cn-big5'`, `'csbig5'`, `'x-x-big5'` |
+| `'euc-jp'`        | `'cseucpkdfmtjapanese'`, `'x-euc-jp'` |
+| `'iso-2022-jp'`   | `'csiso2022jp'` |
+| `'shift_jis'`     | `'csshiftjis'`, `'ms932'`, `'ms_kanji'`, `'shift-jis'`, `'sjis'`, `'windows-31j'`, `'x-sjis'` |
+| `'euc-kr'`        | `'cseuckr'`, `'csksc56011987'`, `'iso-ir-149'`, `'korean'`, `'ks_c_5601-1987'`, `'ks_c_5601-1989'`, `'ksc5601'`, `'ksc_5601'`, `'windows-949'` |
+
+*Note*: The `'iso-8859-16'` encoding listed in the [WHATWG Encoding Standard][]
+is not supported.
+
+#### new TextDecoder([encoding[, options]])
+
+* `encoding` {string} Identifies the `encoding` that this `TextDecoder` instance
+  supports. Defaults to `'utf-8'`.
+* `options` {Object}
+  * `fatal` {boolean} `true` if decoding failures are fatal. Defaults to
+    `false`.
+  * `ignoreBOM` {boolean} When `true`, the `TextDecoder` will include the byte
+     order mark in the decoded result. When `false`, the byte order mark will
+     be removed from the output. This option is only used when `encoding` is
+     `'utf-8'`, `'utf-16be'` or `'utf-16le'`. Defaults to `false`.
+
+Creates an new `TextDecoder` instance. The `encoding` may specify one of the
+supported encodings or an alias.
+
+#### textDecoder.decode([input[, options]])
+
+* `input` {ArrayBuffer|DataView|TypedArray} An `ArrayBuffer`, `DataView` or
+  Typed Array instance containing the encoded data.
+* `options` {Object}
+  * `stream` {boolean} `true` if additional chunks of data are expected.
+    Defaults to `false`.
+* Returns: {string}
+
+Decodes the `input` and returns a string. If `options.stream` is `true`, any
+incomplete byte sequences occuring at the end of the `input` are buffered
+internally and emitted after the next call to `textDecoder.decode()`.
+
+If `textDecoder.fatal` is `true`, decoding errors that occur will result in a
+`TypeError` being thrown.
+
+#### textDecoder.encoding
+
+* Value: {string}
+
+The encoding supported by the `TextDecoder` instance.
+
+#### textDecoder.fatal
+
+* Value: {boolean}
+
+The value will be `true` if decoding errors result in a `TypeError` being
+thrown.
+
+#### textDecoder.ignoreBOM
+
+* Value: {boolean}
+
+The value will be `true` if the decoding result will include the byte order
+mark.
+
+### Class: util.TextEncoder
+<!-- YAML
+added: REPLACEME
+-->
+
+> Stability: 1 - Experimental
+
+An implementation of the [WHATWG Encoding Standard][] `TextEncoder` API. All
+instances of `TextEncoder` only support `UTF-8` encoding.
+
+```js
+const encoder = new TextEncoder();
+const uint8array = encoder.encode('this is some data');
+```
+
+#### textEncoder.encode([input])
+
+* `input` {string} The text to encode. Defaults to an empty string.
+* Returns: {Uint8Array}
+
+UTF-8 Encodes the `input` string and returns a `Uint8Array` containing the
+encoded bytes.
+
 ## Deprecated APIs

 The following APIs have been deprecated and should no longer be used. Existing
@ -1022,3 +1172,4 @@ Deprecated predecessor of `console.log`.
 [Custom promisified functions]: #util_custom_promisified_functions
 [constructor]: https://developer.mozilla.org/en/JavaScript/Reference/Global_Objects/Object/constructor
 [semantically incompatible]: https://github.com/nodejs/node/issues/4179
+[WHATWG Encoding Standard]: https://encoding.spec.whatwg.org/
--- a/lib/internal/encoding.js
+++ b/lib/internal/encoding.js
@ -0,0 +1,458 @@
+'use strict';
+
+// An implementation of the WHATWG Encoding Standard
+// https://encoding.spec.whatwg.org
+
+const errors = require('internal/errors');
+const kHandle = Symbol('handle');
+const kFlags = Symbol('flags');
+const kEncoding = Symbol('encoding');
+const kDecoder = Symbol('decoder');
+const kEncoder = Symbol('encoder');
+
+let warned = false;
+const experimental =
+  'The WHATWG Encoding Standard implementation is an experimental API. It ' +
+  'should not yet be used in production applications.';
+
+const {
+  getConstructorOf,
+  customInspectSymbol: inspect
+} = require('internal/util');
+
+const {
+  isArrayBuffer
+} = process.binding('util');
+
+const {
+  encodeUtf8String
+} = process.binding('buffer');
+
+const {
+  decode: _decode,
+  getConverter,
+  hasConverter
+} = process.binding('icu');
+
+const CONVERTER_FLAGS_FLUSH = 0x1;
+const CONVERTER_FLAGS_FATAL = 0x2;
+const CONVERTER_FLAGS_IGNORE_BOM = 0x4;
+
+const empty = new Uint8Array(0);
+
+const encodings = new Map([
+  ['unicode-1-1-utf-8', 'utf-8'],
+  ['utf8', 'utf-8'],
+  ['utf-8', 'utf-8'],
+  ['866', 'ibm866'],
+  ['cp866', 'ibm866'],
+  ['csibm866', 'ibm866'],
+  ['ibm866', 'ibm866'],
+  ['csisolatin2', 'iso-8859-2'],
+  ['iso-8859-2', 'iso-8859-2'],
+  ['iso-ir-101', 'iso-8859-2'],
+  ['iso8859-2', 'iso-8859-2'],
+  ['iso88592', 'iso-8859-2'],
+  ['iso_8859-2', 'iso-8859-2'],
+  ['iso_8859-2:1987', 'iso-8859-2'],
+  ['l2', 'iso-8859-2'],
+  ['latin2', 'iso-8859-2'],
+  ['csisolatin3', 'iso-8859-3'],
+  ['iso-8859-3', 'iso-8859-3'],
+  ['iso-ir-109', 'iso-8859-3'],
+  ['iso8859-3', 'iso-8859-3'],
+  ['iso88593', 'iso-8859-3'],
+  ['iso_8859-3', 'iso-8859-3'],
+  ['iso_8859-3:1988', 'iso-8859-3'],
+  ['l3', 'iso-8859-3'],
+  ['latin3', 'iso-8859-3'],
+  ['csisolatin4', 'iso-8859-4'],
+  ['iso-8859-4', 'iso-8859-4'],
+  ['iso-ir-110', 'iso-8859-4'],
+  ['iso8859-4', 'iso-8859-4'],
+  ['iso88594', 'iso-8859-4'],
+  ['iso_8859-4', 'iso-8859-4'],
+  ['iso_8859-4:1988', 'iso-8859-4'],
+  ['l4', 'iso-8859-4'],
+  ['latin4', 'iso-8859-4'],
+  ['csisolatincyrillic', 'iso-8859-5'],
+  ['cyrillic', 'iso-8859-5'],
+  ['iso-8859-5', 'iso-8859-5'],
+  ['iso-ir-144', 'iso-8859-5'],
+  ['iso8859-5', 'iso-8859-5'],
+  ['iso88595', 'iso-8859-5'],
+  ['iso_8859-5', 'iso-8859-5'],
+  ['iso_8859-5:1988', 'iso-8859-5'],
+  ['arabic', 'iso-8859-6'],
+  ['asmo-708', 'iso-8859-6'],
+  ['csiso88596e', 'iso-8859-6'],
+  ['csiso88596i', 'iso-8859-6'],
+  ['csisolatinarabic', 'iso-8859-6'],
+  ['ecma-114', 'iso-8859-6'],
+  ['iso-8859-6', 'iso-8859-6'],
+  ['iso-8859-6-e', 'iso-8859-6'],
+  ['iso-8859-6-i', 'iso-8859-6'],
+  ['iso-ir-127', 'iso-8859-6'],
+  ['iso8859-6', 'iso-8859-6'],
+  ['iso88596', 'iso-8859-6'],
+  ['iso_8859-6', 'iso-8859-6'],
+  ['iso_8859-6:1987', 'iso-8859-6'],
+  ['csisolatingreek', 'iso-8859-7'],
+  ['ecma-118', 'iso-8859-7'],
+  ['elot_928', 'iso-8859-7'],
+  ['greek', 'iso-8859-7'],
+  ['greek8', 'iso-8859-7'],
+  ['iso-8859-7', 'iso-8859-7'],
+  ['iso-ir-126', 'iso-8859-7'],
+  ['iso8859-7', 'iso-8859-7'],
+  ['iso88597', 'iso-8859-7'],
+  ['iso_8859-7', 'iso-8859-7'],
+  ['iso_8859-7:1987', 'iso-8859-7'],
+  ['sun_eu_greek', 'iso-8859-7'],
+  ['csiso88598e', 'iso-8859-8'],
+  ['csisolatinhebrew', 'iso-8859-8'],
+  ['hebrew', 'iso-8859-8'],
+  ['iso-8859-8', 'iso-8859-8'],
+  ['iso-8859-8-e', 'iso-8859-8'],
+  ['iso-ir-138', 'iso-8859-8'],
+  ['iso8859-8', 'iso-8859-8'],
+  ['iso88598', 'iso-8859-8'],
+  ['iso_8859-8', 'iso-8859-8'],
+  ['iso_8859-8:1988', 'iso-8859-8'],
+  ['visual', 'iso-8859-8'],
+  ['csiso88598i', 'iso-8859-8-i'],
+  ['iso-8859-8-i', 'iso-8859-8-i'],
+  ['logical', 'iso-8859-8-i'],
+  ['csisolatin6', 'iso-8859-10'],
+  ['iso-8859-10', 'iso-8859-10'],
+  ['iso-ir-157', 'iso-8859-10'],
+  ['iso8859-10', 'iso-8859-10'],
+  ['iso885910', 'iso-8859-10'],
+  ['l6', 'iso-8859-10'],
+  ['latin6', 'iso-8859-10'],
+  ['iso-8859-13', 'iso-8859-13'],
+  ['iso8859-13', 'iso-8859-13'],
+  ['iso885913', 'iso-8859-13'],
+  ['iso-8859-14', 'iso-8859-14'],
+  ['iso8859-14', 'iso-8859-14'],
+  ['iso885914', 'iso-8859-14'],
+  ['csisolatin9', 'iso-8859-15'],
+  ['iso-8859-15', 'iso-8859-15'],
+  ['iso8859-15', 'iso-8859-15'],
+  ['iso885915', 'iso-8859-15'],
+  ['iso_8859-15', 'iso-8859-15'],
+  ['l9', 'iso-8859-15'],
+  ['cskoi8r', 'koi8-r'],
+  ['koi', 'koi8-r'],
+  ['koi8', 'koi8-r'],
+  ['koi8-r', 'koi8-r'],
+  ['koi8_r', 'koi8-r'],
+  ['koi8-ru', 'koi8-u'],
+  ['koi8-u', 'koi8-u'],
+  ['csmacintosh', 'macintosh'],
+  ['mac', 'macintosh'],
+  ['macintosh', 'macintosh'],
+  ['x-mac-roman', 'macintosh'],
+  ['dos-874', 'windows-874'],
+  ['iso-8859-11', 'windows-874'],
+  ['iso8859-11', 'windows-874'],
+  ['iso885911', 'windows-874'],
+  ['tis-620', 'windows-874'],
+  ['windows-874', 'windows-874'],
+  ['cp1250', 'windows-1250'],
+  ['windows-1250', 'windows-1250'],
+  ['x-cp1250', 'windows-1250'],
+  ['cp1251', 'windows-1251'],
+  ['windows-1251', 'windows-1251'],
+  ['x-cp1251', 'windows-1251'],
+  ['ansi_x3.4-1968', 'windows-1252'],
+  ['ascii', 'windows-1252'],
+  ['cp1252', 'windows-1252'],
+  ['cp819', 'windows-1252'],
+  ['csisolatin1', 'windows-1252'],
+  ['ibm819', 'windows-1252'],
+  ['iso-8859-1', 'windows-1252'],
+  ['iso-ir-100', 'windows-1252'],
+  ['iso8859-1', 'windows-1252'],
+  ['iso88591', 'windows-1252'],
+  ['iso_8859-1', 'windows-1252'],
+  ['iso_8859-1:1987', 'windows-1252'],
+  ['l1', 'windows-1252'],
+  ['latin1', 'windows-1252'],
+  ['us-ascii', 'windows-1252'],
+  ['windows-1252', 'windows-1252'],
+  ['x-cp1252', 'windows-1252'],
+  ['cp1253', 'windows-1253'],
+  ['windows-1253', 'windows-1253'],
+  ['x-cp1253', 'windows-1253'],
+  ['cp1254', 'windows-1254'],
+  ['csisolatin5', 'windows-1254'],
+  ['iso-8859-9', 'windows-1254'],
+  ['iso-ir-148', 'windows-1254'],
+  ['iso8859-9', 'windows-1254'],
+  ['iso88599', 'windows-1254'],
+  ['iso_8859-9', 'windows-1254'],
+  ['iso_8859-9:1989', 'windows-1254'],
+  ['l5', 'windows-1254'],
+  ['latin5', 'windows-1254'],
+  ['windows-1254', 'windows-1254'],
+  ['x-cp1254', 'windows-1254'],
+  ['cp1255', 'windows-1255'],
+  ['windows-1255', 'windows-1255'],
+  ['x-cp1255', 'windows-1255'],
+  ['cp1256', 'windows-1256'],
+  ['windows-1256', 'windows-1256'],
+  ['x-cp1256', 'windows-1256'],
+  ['cp1257', 'windows-1257'],
+  ['windows-1257', 'windows-1257'],
+  ['x-cp1257', 'windows-1257'],
+  ['cp1258', 'windows-1258'],
+  ['windows-1258', 'windows-1258'],
+  ['x-cp1258', 'windows-1258'],
+  ['x-mac-cyrillic', 'x-mac-cyrillic'],
+  ['x-mac-ukrainian', 'x-mac-cyrillic'],
+  ['chinese', 'gbk'],
+  ['csgb2312', 'gbk'],
+  ['csiso58gb231280', 'gbk'],
+  ['gb2312', 'gbk'],
+  ['gb_2312', 'gbk'],
+  ['gb_2312-80', 'gbk'],
+  ['gbk', 'gbk'],
+  ['iso-ir-58', 'gbk'],
+  ['x-gbk', 'gbk'],
+  ['gb18030', 'gb18030'],
+  ['big5', 'big5'],
+  ['big5-hkscs', 'big5'],
+  ['cn-big5', 'big5'],
+  ['csbig5', 'big5'],
+  ['x-x-big5', 'big5'],
+  ['cseucpkdfmtjapanese', 'euc-jp'],
+  ['euc-jp', 'euc-jp'],
+  ['x-euc-jp', 'euc-jp'],
+  ['csiso2022jp', 'iso-2022-jp'],
+  ['iso-2022-jp', 'iso-2022-jp'],
+  ['csshiftjis', 'shift_jis'],
+  ['ms932', 'shift_jis'],
+  ['ms_kanji', 'shift_jis'],
+  ['shift-jis', 'shift_jis'],
+  ['shift_jis', 'shift_jis'],
+  ['sjis', 'shift_jis'],
+  ['windows-31j', 'shift_jis'],
+  ['x-sjis', 'shift_jis'],
+  ['cseuckr', 'euc-kr'],
+  ['csksc56011987', 'euc-kr'],
+  ['euc-kr', 'euc-kr'],
+  ['iso-ir-149', 'euc-kr'],
+  ['korean', 'euc-kr'],
+  ['ks_c_5601-1987', 'euc-kr'],
+  ['ks_c_5601-1989', 'euc-kr'],
+  ['ksc5601', 'euc-kr'],
+  ['ksc_5601', 'euc-kr'],
+  ['windows-949', 'euc-kr'],
+  ['utf-16be', 'utf-16be'],
+  ['utf-16le', 'utf-16le'],
+  ['utf-16', 'utf-16le']
+]);
+
+// Unfortunately, String.prototype.trim also removes non-ascii whitespace,
+// so we have to do this manually
+function trimAsciiWhitespace(label) {
+  var s = 0;
+  var e = label.length;
+  while (s < e && (
+    label[s] === '\u0009' ||
+    label[s] === '\u000a' ||
+    label[s] === '\u000c' ||
+    label[s] === '\u000d' ||
+    label[s] === '\u0020')) {
+    s++;
+  }
+  while (e > s && (
+    label[e - 1] === '\u0009' ||
+    label[e - 1] === '\u000a' ||
+    label[e - 1] === '\u000c' ||
+    label[e - 1] === '\u000d' ||
+    label[e - 1] === '\u0020')) {
+    e--;
+  }
+  return label.slice(s, e);
+}
+
+function getEncodingFromLabel(label) {
+  const enc = encodings.get(label);
+  if (enc !== undefined) return enc;
+  return encodings.get(trimAsciiWhitespace(label.toLowerCase()));
+}
+
+function hasTextDecoder(encoding = 'utf-8') {
+  if (typeof encoding !== 'string')
+    throw new errors.Error('ERR_INVALID_ARG_TYPE', 'encoding', 'string');
+  return hasConverter(getEncodingFromLabel(encoding));
+}
+
+var Buffer;
+function lazyBuffer() {
+  if (Buffer === undefined)
+    Buffer = require('buffer').Buffer;
+  return Buffer;
+}
+
+class TextDecoder {
+  constructor(encoding = 'utf-8', options = {}) {
+    if (!warned) {
+      warned = true;
+      process.emitWarning(experimental, 'ExperimentalWarning');
+    }
+
+    encoding = `${encoding}`;
+    if (typeof options !== 'object')
+      throw new errors.Error('ERR_INVALID_ARG_TYPE', 'options', 'object');
+
+    const enc = getEncodingFromLabel(encoding);
+    if (enc === undefined)
+      throw new errors.RangeError('ERR_ENCODING_NOT_SUPPORTED', encoding);
+
+    var flags = 0;
+    if (options !== null) {
+      flags |= options.fatal ? CONVERTER_FLAGS_FATAL : 0;
+      flags |= options.ignoreBOM ? CONVERTER_FLAGS_IGNORE_BOM : 0;
+    }
+
+    const handle = getConverter(enc, flags);
+    if (handle === undefined)
+      throw new errors.Error('ERR_ENCODING_NOT_SUPPORTED', encoding);
+
+    this[kHandle] = handle;
+    this[kFlags] = flags;
+    this[kEncoding] = enc;
+  }
+
+  get encoding() {
+    if (this == null || this[kDecoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextDecoder');
+    return this[kEncoding];
+  }
+
+  get fatal() {
+    if (this == null || this[kDecoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextDecoder');
+    return (this[kFlags] & CONVERTER_FLAGS_FATAL) === CONVERTER_FLAGS_FATAL;
+  }
+
+  get ignoreBOM() {
+    if (this == null || this[kDecoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextDecoder');
+    return (this[kFlags] & CONVERTER_FLAGS_IGNORE_BOM) ===
+           CONVERTER_FLAGS_IGNORE_BOM;
+  }
+
+  decode(input = empty, options = {}) {
+    if (this == null || this[kDecoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextDecoder');
+    if (isArrayBuffer(input)) {
+      input = lazyBuffer().from(input);
+    } else if (!ArrayBuffer.isView(input)) {
+      throw new errors.TypeError('ERR_INVALID_ARG_TYPE', 'input',
+                                 ['ArrayBuffer', 'ArrayBufferView']);
+    }
+    if (typeof options !== 'object') {
+      throw new errors.TypeError('ERR_INVALID_ARG_TYPE', 'options', 'object');
+    }
+
+    var flags = 0;
+    if (options !== null)
+      flags |= options.stream ? 0 : CONVERTER_FLAGS_FLUSH;
+
+    const ret = _decode(this[kHandle], input, flags);
+    if (typeof ret === 'number') {
+      const err = new errors.TypeError('ERR_ENCODING_INVALID_ENCODED_DATA',
+                                       this.encoding);
+      err.errno = ret;
+      throw err;
+    }
+    return ret.toString('ucs2');
+  }
+
+  [inspect](depth, opts) {
+    if (this == null || this[kDecoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextDecoder');
+    if (typeof depth === 'number' && depth < 0)
+      return opts.stylize('[Object]', 'special');
+    var ctor = getConstructorOf(this);
+    var obj = Object.create({
+      constructor: ctor === null ? TextDecoder : ctor
+    });
+    obj.encoding = this.encoding;
+    obj.fatal = this.fatal;
+    obj.ignoreBOM = this.ignoreBOM;
+    if (opts.showHidden) {
+      obj[kFlags] = this[kFlags];
+      obj[kHandle] = this[kHandle];
+    }
+    // Lazy to avoid circular dependency
+    return require('util').inspect(obj, opts);
+  }
+}
+
+class TextEncoder {
+  constructor() {
+    if (!warned) {
+      warned = true;
+      process.emitWarning(experimental, 'ExperimentalWarning');
+    }
+  }
+
+  get encoding() {
+    if (this == null || this[kEncoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextEncoder');
+    return 'utf-8';
+  }
+
+  encode(input = '') {
+    if (this == null || this[kEncoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextEncoder');
+    return encodeUtf8String(`${input}`);
+  }
+
+  [inspect](depth, opts) {
+    if (this == null || this[kEncoder] !== true)
+      throw new errors.TypeError('ERR_INVALID_THIS', 'TextEncoder');
+    if (typeof depth === 'number' && depth < 0)
+      return opts.stylize('[Object]', 'special');
+    var ctor = getConstructorOf(this);
+    var obj = Object.create({
+      constructor: ctor === null ? TextEncoder : ctor
+    });
+    obj.encoding = this.encoding;
+    // Lazy to avoid circular dependency
+    return require('util').inspect(obj, opts);
+  }
+}
+
+Object.defineProperties(
+  TextDecoder.prototype, {
+    [kDecoder]: { enumerable: false, value: true, configurable: false },
+    'decode': { enumerable: true },
+    'encoding': { enumerable: true },
+    'fatal': { enumerable: true },
+    'ignoreBOM': { enumerable: true },
+    [Symbol.toStringTag]: {
+      configurable: true,
+      value: 'TextDecoder'
+    } });
+Object.defineProperties(
+  TextEncoder.prototype, {
+    [kEncoder]: { enumerable: false, value: true, configurable: false },
+    'encode': { enumerable: true },
+    'encoding': { enumerable: true },
+    [Symbol.toStringTag]: {
+      configurable: true,
+      value: 'TextEncoder'
+    } });
+
+module.exports = {
+  getEncodingFromLabel,
+  hasTextDecoder,
+  TextDecoder,
+  TextEncoder
+};
--- a/lib/internal/errors.js
+++ b/lib/internal/errors.js
@ -109,6 +109,10 @@ E('ERR_CPU_USAGE', 'Unable to obtain cpu usage %s');
 E('ERR_DNS_SET_SERVERS_FAILED', (err, servers) =>
  `c-ares failed to set servers: "${err}" [${servers}]`);
 E('ERR_FALSY_VALUE_REJECTION', 'Promise was rejected with falsy value');
+E('ERR_ENCODING_NOT_SUPPORTED',
+  (enc) => `The "${enc}" encoding is not supported`);
+E('ERR_ENCODING_INVALID_ENCODED_DATA',
+  (enc) => `The encoded data was not valid for encoding ${enc}`);
 E('ERR_HTTP_HEADERS_SENT',
  'Cannot render headers after they are sent to the client');
 E('ERR_HTTP_INVALID_STATUS_CODE', 'Invalid status code: %s');
--- a/lib/util.js
+++ b/lib/util.js
@ -22,6 +22,7 @@
 'use strict';

 const errors = require('internal/errors');
+const { TextDecoder, TextEncoder } = require('internal/encoding');

 const { errname } = process.binding('uv');

@ -1125,6 +1126,8 @@ module.exports = exports = {
  isPrimitive,
  log,
  promisify,
+  TextDecoder,
+  TextEncoder,

  // Deprecated Old Stuff
  debug: deprecate(debug,
--- a/node.gyp
+++ b/node.gyp
@ -82,6 +82,7 @@
      'lib/internal/cluster/shared_handle.js',
      'lib/internal/cluster/utils.js',
      'lib/internal/cluster/worker.js',
+      'lib/internal/encoding.js',
      'lib/internal/errors.js',
      'lib/internal/freelist.js',
      'lib/internal/fs.js',
--- a/src/node_buffer.cc
+++ b/src/node_buffer.cc
@ -1200,6 +1200,27 @@ void Swap64(const FunctionCallbackInfo<Value>& args) {
 }


+// Encode a single string to a UTF-8 Uint8Array (not Buffer).
+// Used in TextEncoder.prototype.encode.
+static void EncodeUtf8String(const FunctionCallbackInfo<Value>& args) {
+  Environment* env = Environment::GetCurrent(args);
+  CHECK_GE(args.Length(), 1);
+  CHECK(args[0]->IsString());
+
+  Local<String> str = args[0].As<String>();
+  size_t length = str->Utf8Length();
+  char* data = node::UncheckedMalloc(length);
+  str->WriteUtf8(data,
+                 -1,   // We are certain that `data` is sufficiently large
+                 NULL,
+                 String::NO_NULL_TERMINATION | String::REPLACE_INVALID_UTF8);
+  auto array_buf = ArrayBuffer::New(env->isolate(), data, length,
+                                    ArrayBufferCreationMode::kInternalized);
+  auto array = Uint8Array::New(array_buf, 0, length);
+  args.GetReturnValue().Set(array);
+}
+
+
 // pass Buffer object to load prototype methods
 void SetupBufferJS(const FunctionCallbackInfo<Value>& args) {
  Environment* env = Environment::GetCurrent(args);
@ -1266,6 +1287,8 @@ void Initialize(Local<Object> target,
  env->SetMethod(target, "swap32", Swap32);
  env->SetMethod(target, "swap64", Swap64);

+  env->SetMethod(target, "encodeUtf8String", EncodeUtf8String);
+
  target->Set(env->context(),
              FIXED_ONE_BYTE_STRING(env->isolate(), "kMaxLength"),
              Integer::NewFromUnsigned(env->isolate(), kMaxLength)).FromJust();
--- a/src/node_i18n.cc
+++ b/src/node_i18n.cc
@ -50,6 +50,8 @@
 #include "env-inl.h"
 #include "util.h"
 #include "util-inl.h"
+#include "base-object.h"
+#include "base-object-inl.h"
 #include "v8.h"

 #include <unicode/utypes.h>
@ -86,10 +88,12 @@ namespace node {

 using v8::Context;
 using v8::FunctionCallbackInfo;
+using v8::HandleScope;
 using v8::Isolate;
 using v8::Local;
 using v8::MaybeLocal;
 using v8::Object;
+using v8::ObjectTemplate;
 using v8::String;
 using v8::Value;

@ -123,6 +127,15 @@ struct Converter {
    }
  }

+  explicit Converter(UConverter* converter,
+                     const char* sub = NULL) : conv(converter) {
+    CHECK_NE(conv, nullptr);
+    UErrorCode status = U_ZERO_ERROR;
+    if (sub != NULL) {
+      ucnv_setSubstChars(conv, sub, strlen(sub), &status);
+    }
+  }
+
  ~Converter() {
    ucnv_close(conv);
  }
@ -130,6 +143,143 @@ struct Converter {
  UConverter* conv;
 };

+class ConverterObject : public BaseObject, Converter {
+ public:
+  enum ConverterFlags {
+    CONVERTER_FLAGS_FLUSH      = 0x1,
+    CONVERTER_FLAGS_FATAL      = 0x2,
+    CONVERTER_FLAGS_IGNORE_BOM = 0x4
+  };
+
+  ~ConverterObject() override {}
+
+  static void Has(const FunctionCallbackInfo<Value>& args) {
+    Environment* env = Environment::GetCurrent(args);
+    HandleScope scope(env->isolate());
+
+    CHECK_GE(args.Length(), 1);
+    Utf8Value label(env->isolate(), args[0]);
+
+    UErrorCode status = U_ZERO_ERROR;
+    UConverter* conv = ucnv_open(*label, &status);
+    args.GetReturnValue().Set(!!U_SUCCESS(status));
+    ucnv_close(conv);
+  }
+
+  static void Create(const FunctionCallbackInfo<Value>& args) {
+    Environment* env = Environment::GetCurrent(args);
+    HandleScope scope(env->isolate());
+
+    CHECK_GE(args.Length(), 2);
+    Utf8Value label(env->isolate(), args[0]);
+    int flags = args[1]->Uint32Value(env->context()).ToChecked();
+    bool fatal =
+        (flags & CONVERTER_FLAGS_FATAL) == CONVERTER_FLAGS_FATAL;
+    bool ignoreBOM =
+        (flags & CONVERTER_FLAGS_IGNORE_BOM) == CONVERTER_FLAGS_IGNORE_BOM;
+
+    UErrorCode status = U_ZERO_ERROR;
+    UConverter* conv = ucnv_open(*label, &status);
+    if (U_FAILURE(status))
+      return;
+
+    if (fatal) {
+      status = U_ZERO_ERROR;
+      ucnv_setToUCallBack(conv, UCNV_TO_U_CALLBACK_STOP,
+                          nullptr, nullptr, nullptr, &status);
+    }
+
+    Local<ObjectTemplate> t = ObjectTemplate::New(env->isolate());
+    t->SetInternalFieldCount(1);
+    Local<Object> obj = t->NewInstance(env->context()).ToLocalChecked();
+    new ConverterObject(env, obj, conv, ignoreBOM);
+    args.GetReturnValue().Set(obj);
+  }
+
+  static void Decode(const FunctionCallbackInfo<Value>& args) {
+    Environment* env = Environment::GetCurrent(args);
+
+    CHECK_GE(args.Length(), 3);  // Converter, Buffer, Flags
+
+    Converter utf8("utf8");
+    ConverterObject* converter;
+    ASSIGN_OR_RETURN_UNWRAP(&converter, args[0].As<Object>());
+    SPREAD_BUFFER_ARG(args[1], input_obj);
+    int flags = args[2]->Uint32Value(env->context()).ToChecked();
+
+    UErrorCode status = U_ZERO_ERROR;
+    MaybeStackBuffer<UChar> result;
+    MaybeLocal<Object> ret;
+    size_t limit = ucnv_getMinCharSize(converter->conv) *
+                   input_obj_length;
+    if (limit > 0)
+      result.AllocateSufficientStorage(limit);
+
+    UBool flush = (flags & CONVERTER_FLAGS_FLUSH) == CONVERTER_FLAGS_FLUSH;
+
+    const char* source = input_obj_data;
+    size_t source_length = input_obj_length;
+
+    if (converter->unicode_ && !converter->ignoreBOM_ && !converter->bomSeen_) {
+      int32_t bomOffset = 0;
+      ucnv_detectUnicodeSignature(source, source_length, &bomOffset, &status);
+      source += bomOffset;
+      source_length -= bomOffset;
+      converter->bomSeen_ = true;
+    }
+
+    UChar* target = *result;
+    ucnv_toUnicode(converter->conv,
+                   &target, target + (limit * sizeof(UChar)),
+                   &source, source + source_length,
+                   NULL, flush, &status);
+
+    if (U_SUCCESS(status)) {
+      if (limit > 0)
+        result.SetLength(target - &result[0]);
+      ret = ToBufferEndian(env, &result);
+      args.GetReturnValue().Set(ret.ToLocalChecked());
+      goto reset;
+    }
+
+    args.GetReturnValue().Set(status);
+
+   reset:
+    if (flush) {
+      // Reset the converter state
+      converter->bomSeen_ = false;
+      ucnv_reset(converter->conv);
+    }
+  }
+
+ protected:
+  ConverterObject(Environment* env,
+                  v8::Local<v8::Object> wrap,
+                  UConverter* converter,
+                  bool ignoreBOM,
+                  const char* sub = NULL) :
+                  BaseObject(env, wrap),
+                  Converter(converter, sub),
+                  ignoreBOM_(ignoreBOM) {
+    MakeWeak<ConverterObject>(this);
+
+    switch (ucnv_getType(converter)) {
+      case UCNV_UTF8:
+      case UCNV_UTF16_BigEndian:
+      case UCNV_UTF16_LittleEndian:
+        unicode_ = true;
+        break;
+      default:
+        unicode_ = false;
+    }
+  }
+
+ private:
+  bool unicode_ = false;     // True if this is a Unicode converter
+  bool ignoreBOM_ = false;   // True if the BOM should be ignored on Unicode
+  bool bomSeen_ = false;     // True if the BOM has been seen
+};
+
 // One-Shot Converters

 void CopySourceBuffer(MaybeStackBuffer<UChar>* dest,
@ -717,6 +867,11 @@ void Init(Local<Object> target,
  // One-shot converters
  env->SetMethod(target, "icuErrName", ICUErrorName);
  env->SetMethod(target, "transcode", Transcode);
+
+  // ConverterObject
+  env->SetMethod(target, "getConverter", ConverterObject::Create);
+  env->SetMethod(target, "decode", ConverterObject::Decode);
+  env->SetMethod(target, "hasConverter", ConverterObject::Has);
 }

 }  // namespace i18n
--- a/src/node_i18n.h
+++ b/src/node_i18n.h
@ -25,6 +25,7 @@
 #if defined(NODE_WANT_INTERNALS) && NODE_WANT_INTERNALS

 #include "node.h"
+#include <unicode/ucnv.h>
 #include <string>

 #if defined(NODE_HAVE_I18N_SUPPORT)
--- a/src/node_util.cc
+++ b/src/node_util.cc
@ -21,6 +21,7 @@ using v8::Value;


 #define VALUE_METHOD_MAP(V)                                                   \
+  V(isArrayBuffer, IsArrayBuffer)                                             \
  V(isAsyncFunction, IsAsyncFunction)                                         \
  V(isDataView, IsDataView)                                                   \
  V(isDate, IsDate)                                                           \
--- a/test/parallel/test-whatwg-encoding.js
+++ b/test/parallel/test-whatwg-encoding.js
@ -0,0 +1,385 @@
+// Flags: --expose-internals
+'use strict';
+
+const common = require('../common');
+const assert = require('assert');
+const { TextEncoder, TextDecoder } = require('util');
+const { customInspectSymbol: inspect } = require('internal/util');
+const { getEncodingFromLabel } = require('internal/encoding');
+
+const encoded = Buffer.from([0xef, 0xbb, 0xbf, 0x74, 0x65,
+                             0x73, 0x74, 0xe2, 0x82, 0xac]);
+
+if (!common.hasIntl) {
+  common.skip('WHATWG Encoding tests because ICU is not present.');
+}
+
+// Make Sure TextDecoder and TextEncoder exist
+assert(TextDecoder);
+assert(TextEncoder);
+
+// Test TextEncoder
+const enc = new TextEncoder();
+assert(enc);
+const buf = enc.encode('\ufefftest€');
+
+assert.strictEqual(Buffer.compare(buf, encoded), 0);
+
+
+// Test TextDecoder, UTF-8, fatal: false, ignoreBOM: false
+{
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i);
+    const res = dec.decode(buf);
+    assert.strictEqual(res, 'test€');
+  });
+
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i);
+    let res = '';
+    res += dec.decode(buf.slice(0, 8), { stream: true });
+    res += dec.decode(buf.slice(8));
+    assert.strictEqual(res, 'test€');
+  });
+}
+
+// Test TextDecoder, UTF-8, fatal: false, ignoreBOM: true
+{
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i, { ignoreBOM: true });
+    const res = dec.decode(buf);
+    assert.strictEqual(res, '\ufefftest€');
+  });
+
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i, { ignoreBOM: true });
+    let res = '';
+    res += dec.decode(buf.slice(0, 8), { stream: true });
+    res += dec.decode(buf.slice(8));
+    assert.strictEqual(res, '\ufefftest€');
+  });
+}
+
+// Test TextDecoder, UTF-8, fatal: true, ignoreBOM: false
+{
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i, { fatal: true });
+    assert.throws(() => dec.decode(buf.slice(0, 8)),
+                  common.expectsError({
+                    code: 'ERR_ENCODING_INVALID_ENCODED_DATA',
+                    type: TypeError,
+                    message:
+                      /^The encoded data was not valid for encoding utf-8$/
+                  }));
+  });
+
+  ['unicode-1-1-utf-8', 'utf8', 'utf-8'].forEach((i) => {
+    const dec = new TextDecoder(i, { fatal: true });
+    assert.doesNotThrow(() => dec.decode(buf.slice(0, 8), { stream: true }));
+    assert.doesNotThrow(() => dec.decode(buf.slice(8)));
+  });
+}
+
+// Test TextDecoder, UTF-16le
+{
+  const dec = new TextDecoder('utf-16le');
+  const res = dec.decode(Buffer.from('test€', 'utf-16le'));
+  assert.strictEqual(res, 'test€');
+}
+
+// Test TextDecoder, UTF-16be
+{
+  const dec = new TextDecoder('utf-16be');
+  const res = dec.decode(Buffer.from([0x00, 0x74, 0x00, 0x65, 0x00,
+                                      0x73, 0x00, 0x74, 0x20, 0xac]));
+  assert.strictEqual(res, 'test€');
+}
+
+{
+  const fn = TextDecoder.prototype[inspect];
+  fn.call(new TextDecoder(), Infinity, {});
+
+  [{}, [], true, 1, '', new TextEncoder()].forEach((i) => {
+    assert.throws(() => fn.call(i, Infinity, {}),
+                  common.expectsError({
+                    code: 'ERR_INVALID_THIS',
+                    message: 'Value of "this" must be of type TextDecoder'
+                  }));
+  });
+}
+
+{
+  const fn = TextEncoder.prototype[inspect];
+  fn.call(new TextEncoder(), Infinity, {});
+
+  [{}, [], true, 1, '', new TextDecoder()].forEach((i) => {
+    assert.throws(() => fn.call(i, Infinity, {}),
+                  common.expectsError({
+                    code: 'ERR_INVALID_THIS',
+                    message: 'Value of "this" must be of type TextEncoder'
+                  }));
+  });
+}
+
+// Test Encoding Mappings
+{
+
+  const mappings = {
+    'utf-8': [
+      'unicode-1-1-utf-8',
+      'utf8'
+    ],
+    'utf-16be': [],
+    'utf-16le': [
+      'utf-16'
+    ],
+    'ibm866': [
+      '866',
+      'cp866',
+      'csibm866'
+    ],
+    'iso-8859-2': [
+      'csisolatin2',
+      'iso-ir-101',
+      'iso8859-2',
+      'iso88592',
+      'iso_8859-2',
+      'iso_8859-2:1987',
+      'l2',
+      'latin2'
+    ],
+    'iso-8859-3': [
+      'csisolatin3',
+      'iso-ir-109',
+      'iso8859-3',
+      'iso88593',
+      'iso_8859-3',
+      'iso_8859-3:1988',
+      'l3',
+      'latin3'
+    ],
+    'iso-8859-4': [
+      'csisolatin4',
+      'iso-ir-110',
+      'iso8859-4',
+      'iso88594',
+      'iso_8859-4',
+      'iso_8859-4:1988',
+      'l4',
+      'latin4'
+    ],
+    'iso-8859-5': [
+      'csisolatincyrillic',
+      'cyrillic',
+      'iso-ir-144',
+      'iso8859-5',
+      'iso88595',
+      'iso_8859-5',
+      'iso_8859-5:1988'
+    ],
+    'iso-8859-6': [
+      'arabic',
+      'asmo-708',
+      'csiso88596e',
+      'csiso88596i',
+      'csisolatinarabic',
+      'ecma-114',
+      'iso-8859-6-e',
+      'iso-8859-6-i',
+      'iso-ir-127',
+      'iso8859-6',
+      'iso88596',
+      'iso_8859-6',
+      'iso_8859-6:1987'
+    ],
+    'iso-8859-7': [
+      'csisolatingreek',
+      'ecma-118',
+      'elot_928',
+      'greek',
+      'greek8',
+      'iso-ir-126',
+      'iso8859-7',
+      'iso88597',
+      'iso_8859-7',
+      'iso_8859-7:1987',
+      'sun_eu_greek'
+    ],
+    'iso-8859-8': [
+      'csiso88598e',
+      'csisolatinhebrew',
+      'hebrew',
+      'iso-8859-8-e',
+      'iso-ir-138',
+      'iso8859-8',
+      'iso88598',
+      'iso_8859-8',
+      'iso_8859-8:1988',
+      'visual'
+    ],
+    'iso-8859-8-i': [
+      'csiso88598i',
+      'logical'
+    ],
+    'iso-8859-10': [
+      'csisolatin6',
+      'iso-ir-157',
+      'iso8859-10',
+      'iso885910',
+      'l6',
+      'latin6'
+    ],
+    'iso-8859-13': [
+      'iso8859-13',
+      'iso885913'
+    ],
+    'iso-8859-14': [
+      'iso8859-14',
+      'iso885914'
+    ],
+    'iso-8859-15': [
+      'csisolatin9',
+      'iso8859-15',
+      'iso885915',
+      'iso_8859-15',
+      'l9'
+    ],
+    'koi8-r': [
+      'cskoi8r',
+      'koi',
+      'koi8',
+      'koi8_r'
+    ],
+    'koi8-u': [
+      'koi8-ru'
+    ],
+    'macintosh': [
+      'csmacintosh',
+      'mac',
+      'x-mac-roman'
+    ],
+    'windows-874': [
+      'dos-874',
+      'iso-8859-11',
+      'iso8859-11',
+      'iso885911',
+      'tis-620'
+    ],
+    'windows-1250': [
+      'cp1250',
+      'x-cp1250'
+    ],
+    'windows-1251': [
+      'cp1251',
+      'x-cp1251'
+    ],
+    'windows-1252': [
+      'ansi_x3.4-1968',
+      'ascii',
+      'cp1252',
+      'cp819',
+      'csisolatin1',
+      'ibm819',
+      'iso-8859-1',
+      'iso-ir-100',
+      'iso8859-1',
+      'iso88591',
+      'iso_8859-1',
+      'iso_8859-1:1987',
+      'l1',
+      'latin1',
+      'us-ascii',
+      'x-cp1252'
+    ],
+    'windows-1253': [
+      'cp1253',
+      'x-cp1253'
+    ],
+    'windows-1254': [
+      'cp1254',
+      'csisolatin5',
+      'iso-8859-9',
+      'iso-ir-148',
+      'iso8859-9',
+      'iso88599',
+      'iso_8859-9',
+      'iso_8859-9:1989',
+      'l5',
+      'latin5',
+      'x-cp1254'
+    ],
+    'windows-1255': [
+      'cp1255',
+      'x-cp1255'
+    ],
+    'windows-1256': [
+      'cp1256',
+      'x-cp1256'
+    ],
+    'windows-1257': [
+      'cp1257',
+      'x-cp1257'
+    ],
+    'windows-1258': [
+      'cp1258',
+      'x-cp1258'
+    ],
+    'x-mac-cyrillic': [
+      'x-mac-ukrainian'
+    ],
+    'gbk': [
+      'chinese',
+      'csgb2312',
+      'csiso58gb231280',
+      'gb2312',
+      'gb_2312',
+      'gb_2312-80',
+      'iso-ir-58',
+      'x-gbk'
+    ],
+    'gb18030': [ ],
+    'big5': [
+      'big5-hkscs',
+      'cn-big5',
+      'csbig5',
+      'x-x-big5'
+    ],
+    'euc-jp': [
+      'cseucpkdfmtjapanese',
+      'x-euc-jp'
+    ],
+    'iso-2022-jp': [
+      'csiso2022jp'
+    ],
+    'shift_jis': [
+      'csshiftjis',
+      'ms932',
+      'ms_kanji',
+      'shift-jis',
+      'sjis',
+      'windows-31j',
+      'x-sjis'
+    ],
+    'euc-kr': [
+      '  euc-kr  \t',
+      'EUC-kr  \n',
+      'cseuckr',
+      'csksc56011987',
+      'iso-ir-149',
+      'korean',
+      'ks_c_5601-1987',
+      'ks_c_5601-1989',
+      'ksc5601',
+      'ksc_5601',
+      'windows-949'
+    ]
+  };
+  Object.entries(mappings).forEach((i) => {
+    const enc = i[0];
+    const labels = i[1];
+    assert.strictEqual(getEncodingFromLabel(enc), enc);
+    labels.forEach((l) => assert.strictEqual(getEncodingFromLabel(l), enc));
+  });
+
+  assert.strictEqual(getEncodingFromLabel('made-up'), undefined);
+}
--- a/tools/icu/icu-generic.gyp
+++ b/tools/icu/icu-generic.gyp
@ -30,15 +30,6 @@
      'type': 'none',
      'toolsets': [ 'host', 'target' ],
      'direct_dependent_settings': {
-        'conditions': [
-          [ 'icu_endianness == "l"', {
-             'defines': [
-                # ICU cannot swap the initial data without this.
-                # http://bugs.icu-project.org/trac/ticket/11046
-                'UCONFIG_NO_LEGACY_CONVERSION=1'
-             ],
-          }],
-        ],
        'defines': [
          'UCONFIG_NO_SERVICE=1',
          'UCONFIG_NO_REGULAR_EXPRESSIONS=1',