TC39 Proposal: Dynamic Code Brand Checks

Status
TL;DR
Motivation
Problem 1: %eval% does not accept objects in lieu of strings for code
Problem 2: Host callout does not receive type information
Problem 3: Host callout does not receive the code to check
Problem 4: Host callout cannot adjust values
Tests

Status

Champion(s): koto
Author(s): mikesamuel, koto
Stage: 1
Spec: ecmarkup output, source

TL;DR

Allow hosts to create code-like objects and change HostEnsureCanCompileStrings(callerRealm, calleeRealm) to HostValidateDynamicCode(callerRealm, calleeRealm, codeString, wasCodeLike). Hosts can change the to code string to be compiled.

Motivation

eval is Evil

The eval function is the most misused feature of JavaScript. Avoid it.

-- Douglas Crockford, "JavaScript: The Good Parts"

eval and its friend new Function are problematic because, too often, an attacker can turn it against the application.

Most code avoids eval, but JS programs are no longer small, and self-contained as they were when Crock wrote that.

If one module uses eval, even if it's as simple as Function('return this')() to get a handle to the global object then eval has to work.

This prevents the use of security measures like:

Content-Security-Policy
node --disallow_code_generation_from_strings

which turn off eval globally.

As JavaScript programs get larger, the chance that no part of it needs eval or Function() to operate gets smaller.

It is difficult in JavaScript for a code reviewer to determine that code never uses these operators. For example, the below can when x is constructor.

({}[x][x](y)());

// ({})[x]       === Object
// ({})[x][x]    === Function
// ({})[x][x](y)  ~  Function(y)

So code review and developer training are unlikely to prevent abuse of these operators.

This aims to solve the problem by providing more context to host environments so that they can make finer-grained trust decisions. Since strings in Javascript programs often come from untrusted sources (DOM-Based XSS in web platform), the trust decisions might be based on brand-checking objects representing the code that the application author trusts to be evaluated.

Trusted Types

The Trusted Types proposal (explainer for TC39) seeks to guard risky operations like dynamic code evaluation by requiring that code portions have a runtime type that indicates that they have been explicitly trusted.

Specifically, when Trusted Types enforcement is turned on, it would like to ensure that arguments to eval() and new Function() are (or are convertible to) a TrustedScript - an object wrapping around a string stored in an internal, immutable slot. This enhances the existing dynamic code evaluation guards that the web platform already has with CSP (Trusted Types also integrates with other CSP mechanisms).

This proposal seeks to solve the following problems:

Problem 1: %eval% does not accept objects in lieu of strings for code

Currently, typeof x === 'string' || eval(x) === x. Eval exits early when its argument is not a string.

For example:

console.log(eval(123) === 123);

const array = ["alert(1)"];
console.log(eval(array) === array); // Does not alert

const touchy = {
  toString() {
    throw new Error();
  },
};
console.log(eval(touchy) === touchy); // Does not throw.

This follows from step 2 of the definition of PerformEval:

Runtime Semantics: PerformEval ( x, evalRealm, strictCaller, direct )

The abstract operation PerformEval with arguments *x*, *evalRealm*, *strictCaller*, and *direct* performs the following steps:

Assert: If direct is false, then strictCaller is also false.

If Type(x) is not String, return x.

Backwards compatibility constraints

To avoid breaking the web, eval(x) should do nothing different when x is a non-string value that existing applications might create.

Solution

Define a spec abstraction, IsCodeLike, that allows some object values through but without changing the semantics of pre-existing programs.

Define IsCodeLike(*x*), a spec abstraction that returns true for objects containing a host-defined internal slot ([[HostDefinedCodeLike]]).
Tweak PerformEval, which is called by both direct and indirect eval, to use IsCodeLike(*x*) to extract a string from code-like objects before the (Type(x) is String) check. Code-like objects and will not cause the early-exit.

Pros

IsCodeLike has no observable side effect for values. eval is backwards compatible when a program produces no code like values.
User code can only create code-like values via host-defined APIs.

Cons

Can't be tested in test262 (code-like object creation is host-defined).
Code-like values are not proxyable.

Problem 2: Host callout does not receive type information

Currently the information available to decide whether to allow compilation of a string is a pair of realms.

HostEnsureCanCompileStrings( callerRealm, calleeRealm )

HostEnsureCanCompileStrings is an implementation-defined abstract operation that allows host environments to block certain ECMAScript functions which allow developers to compile strings into ECMAScript code.

For the hosts to be able to guard dynamic code evaluation effectively, additional type information is needed.

Solution

This proposal aims to provide additional context to HostEnsureCanCompileStrings, and reorder the steps in CreateDynamicFunction so that HostEnsureCanCompileStrings receives runtime type information - namely, whether the eval argument, or all the Function constructor arguments passed the IsCodeLike check.

Pros

Simple host interface; Does not require passing objects to the host.
No TOCTOU issues - code-like object passing the check is immediately stringified before the host callout happens.

Cons

Requires changes to the host callout (see also below):

Problem 3: Host callout does not receive the code to check

HostEnsureCanCompileStrings only passes the realm. In the web platform, that callout is hooked to the Content Security Policy algorithms, which take action based on code that is to be executed - e.g. an eval() argument, or a dynamically-created function body, to be able to include that code in violation reports (CSP3's issue 8)

As such, some implementations (v8 and SpiderMonkey) actually also pass the code string to the host, and in the case of new Function() perform the callout later in CreateDynamicFunction, after the function body is assembled.

Solution

This proposal updates the host callout to contain the code string to be executed, and moves the callout in CreateDynamicFunction after the function body is assembled.

Spec / implementation mismatch

Moving the callout in CreateDynamicFunction changes the behavior in implementations that specify a non-default host callout.

Currently the stringifier should not execute if the host disables the string compilation:

new Function({
  toString: () => {
    throw "Should not happen, as the callout would reject this earlier";
  },
});

Current implementations differ in behavior. For example, v8 and Spidermonkey are not ES spec-compliant. JSC follows the spec - In-browser proof of concept.

This proposal would cause the stringifier to execute before the callout, making it possible for the browser hosts to actually follow the CSP spec.

Problem 4: Host callout cannot adjust values

Trusted Types defines a default policy to make it easier to migrate applications that pass around strings.

This policy is invoked when a string value reaches a sink, so any default policy's createScript callback is invoked on eval(myString).

If the callback throws an error, then eval should be blocked. But if the callback returns a value, result, then ToString(result) should be used as the source text to parse.

For example, if the default policy's createScript callback returns "output" given "input" then eval("input") would load and run a |ScriptBody| parsed from the source text output.

<script>
  // Define a default policy that maps the source text `input` to `output`.
  window.trustedTypes.createPolicy("default", {
    createScript(code) {
      if (code === "input") {
        return "output";
      }
      throw new Error("blocked script execution");
    },
  });

  globalThis.input = 1;
  globalThis.output = 2;
  // The source text loaded is `output`
  eval("input") === globalThis.output; // true
</script>

Being able to adjust the code that runs provides the maximum flexibility when dealing with a thorny legacy module that might otherwise prevent the entire application from running with XSS protections enabled.

Solution

This proposal adjusts eval and new Function to expect return string values from the host callout and to use those in place of the inputs.

Tests

Related tests at

but testing these require affecting the behavior of host callouts so will probably need to be specified as web-platform-tests.

TC39 Proposal: Dynamic Code Brand Checks

Table of Contents

Status

TL;DR

Motivation

`eval` is Evil

Trusted Types

Problem 1: %eval% does not accept objects in lieu of strings for code

Runtime Semantics: PerformEval ( x, evalRealm, strictCaller, direct )

Backwards compatibility constraints

Solution

Pros

Cons

Problem 2: Host callout does not receive type information

Solution

Pros

Cons

Problem 3: Host callout does not receive the code to check

Solution

Spec / implementation mismatch

Problem 4: Host callout cannot adjust values

Solution

Tests

TC39 Proposal: Dynamic Code Brand Checks

Table of Contents

Status

TL;DR

Motivation

eval is Evil

Trusted Types

Problem 1: %eval% does not accept objects in lieu of strings for code

Runtime Semantics: PerformEval ( x, evalRealm, strictCaller, direct )

Backwards compatibility constraints

Solution

Pros

Cons

Problem 2: Host callout does not receive type information

Solution

Pros

Cons

Problem 3: Host callout does not receive the code to check

Solution

Spec / implementation mismatch

Problem 4: Host callout cannot adjust values

Solution

Tests

`eval` is Evil