JavaScript Structs: Fixed Layout Objects
Stage: 1
Author: Shu-yu Guo (@syg)
Champion: Shu-yu Guo (@syg), Ron Buckton (@rbuckton), Asumu Takikawa (@takikawa), Keith Miller (@kmiller68)
Introduction
Structs are declarative sealed objects. There are two kinds of structs: plain structs and shared structs. Plain structs behave as if they were sealed objects. Shared structs have additional restrictions and can be concurrently accessed from different agents.
All structs have the following properties:
- Opaque storage like plain objects. Not aliasable via
ArrayBuffer
orSharedArrayBuffer
. - Instances have all fields initialized in one shot, then sealed. The engine must be able to fix a layout that is unchanging. This implies that all superclasses must also be structs.
- Transitively immutable [[Prototype]] slot. A struct instance's [[Prototype]] slot is immutable, as are the [[Prototype]] slot of every object on its prototype chain.
Shared structs have the following additional properties:
- Only data fields are allowed. Getters, setters, and methods are disallowed.
- All superclasses must be shared structs.
- Shared structs only reference other primitives or shared structs.
- Shared structs throw on accessing the [[Prototype]] slot.
- Shared structs do not have a
.constructor
property.
This proposal is intended to be minimal.
Structs can be designed with or without novel syntax. For brevity of presentation, examples are given using class
syntax with a struct
or shared struct
qualifier.
A minimal plain struct example.
struct class Box {
constructor(x) { this.x = x; }
x;
}
let box = new Box();
box.x = 42; // x is declared
assertThrows(() => { box.y = 8.8; }); // structs are sealed
assertThrows(() => { box.__proto__ = {} }); // structs are sealed
A minimal shared struct example.
// main.js
shared struct class SharedBox {
constructor(x) { this.x = x; }
x;
}
let sharedBox = new SharedBox();
let sharedBox2 = new SharedBox();
sharedBox.x = 42; // x is declared and rhs is primitive
sharedBox.x = sharedBox2; // x is declared and rhs is shared
assertThrows(() => { sharedBox.x = {}; }) // rhs is not a shared struct
let worker = new Worker('worker.js');
worker.postMessage({ sharedBox });
sharedBox.x = "main"; // x is declared and rhs is primitive
console.log(sharedBox.x);
// worker.js
onmessage = function(e) {
let sharedBox = e.data.sharedBox;
sharedBox.x = "worker"; // x is declared and rhs is primitive
console.log(sharedBox.x);
};
The above program is permitted to print out any interleaving:
- main main
- main worker
- worker worker
- worker main
Motivation and requirements
Shared memory for concurrency
This proposal seeks to enable more shared memory concurrency for a more concurrent future. Like other shared memory features in JavaScript, it is high in expressive power and high in difficulty to use correctly. This proposal is both intended as an incremental step towards higher-level, easier-to-use (e.g. data-race free by construction) concurrency abstractions as well as an escape hatch for expert programmers who need the expressivity.
The two design principles that this proposal follows are:
- Syntax that looks atomic ought to be atomic. (For example, the dot operator on shared structs should only access an existing field and does not tear.)
- There are no references from shared objects to non-shared objects. The shared and non-shared heaps are conceptually separate, with references only going one way.
WasmGC interoperability
The WasmGC proposal adds fixed layout, garbage-collected objects to Wasm. While the details of the type system of these objects are yet to be nailed down, interoperability with JavaScript will be important.
WasmGC objects have opaque storage and are not aliased by linear memory, so they cannot be exposed as all Wasm memory is exposed today via ArrayBuffer
s. We propose structs to be the reflection of WasmGC objects in JS.
WasmGC objects exposed to JS should behave the same as structs, modulo extra type checking that WasmGC require that JS structs do not. JS structs is also a good foundation for reflecting into Wasm as WasmGC objects, but that is currently left as future work as it may need a typed field extensions to be worthwhile.
Further, WasmGC itself will eventually have multithreading. It behooves us to maintain a single memory model between JavaScript and Wasm as we have today, even with higher-level object abstractions.
Predictable instance performance
Objects that are declared with a fixed layout help engines to have more predictable performance. A fixed layout object also lays groundwork for future refinement, such as typed fields.
Possibly in-scope
The following are not currently in scope to maintain the proposal's minimalism. However, they may be brought into the scope of the proposal if they are deemed requirements for ergonomics.
Arrays
For structured data, ergonomic ways to declare inline fixed-length fields seems desirable. Additionally, shared arrays also seem similarly desirable.
Out-of-scope
Value semantics, immutability, and operator overloading
This proposal does not intend to explore the space of objects with value semantics, including immutability and operator overloading. Structs have identity like other objects and are designed to be used like other objects. Value semantics is a sufficient departure that it may be better solved with other proposals that focus on that space.
Sophisticated type systems
This proposal does not intend to explore sophisticated type and runtime guard systems. It is even more minimal than the closest spiritual ancestor, the Typed Objects proposal, in that we do not propose integral types for sized fields. (Typed and sized fields are reserved for future work.)
Binary data overlay views
This proposal does not intend to explore the space of overlaying structured views on binary data in an ArrayBuffer
. This is a requirement arising from the desire for WasmGC integration, and WasmGC objects are similarly opaque.
Structured overlays are fundamentally about aliasing memory, which we feel is both a different problem domain and sufficiently solvable today in userland. For example, see buffer-backed objects.
Proposal
This proposal can be developed with or without novel syntax. It is presented with novel syntax below.
Plain structs
Plain structs are declared with the contextual struct
keyword in front of a class declaration.
struct class
expressions parse the same as plain class
expressions.
During evaluation of a struct class
expression, the following checks are performed.
- If there is an
extends
clause and the superclass is not astruct class
, throw aTypeError
When a struct class
constructor is invoked, it creates instances with the following properties:
- All instance fields, including those from any superclasses, are defined and initialized to
undefined
before the newly constructed instance escapes to the constructor function (aka "one-shot") - Instances are sealed
- Instances' [[Prototype]] slot is immutable after initialization
Shared structs
Shared structs are declared with the contextual shared struct
keywords in front of a class declaration. Intuitively, they behave as very restricted structs.
shared struct class
expressions throw an early error if the following forms are encountered.
- method
- getter
- setter
- field initializers
During evaluation of a struct class
expression, the following checks are performed.
- If there is an
extends
clause and the superclass is not ashared struct class
, throw aTypeError
When a struct class
constructor is invoked, it creates instances with the following properties:
- All instance fields are defined and initialized to
undefined
before the newly constructed instance escapes to the constructor function (aka "one-shot") - Instances are sealed
- Instances' [[Prototype]] slot throw a TypeError when accessed
- Instances do not have a
.constructor
property - Instances are shared instead of copied with other agents
- Instances' identities are preserved when communicated between agents
- When a value is assigned to an instance field, if it is neither a primitive nor a shared struct instance or is a
Symbol
, throw aTypeError
- Instances' field accesses are unordered shared memory accesses and must not tear (i.e. another agent must not observe a partially written value)
The following Atomics
methods will be extended to accept shared struct instances as the first argument and a field name as the second argument, to support sequentially consistent accesses.
Atomics.load
Atomics.store
Atomics.exchange
Atomics.compareExchange
Note that the arithmetic Atomics
methods like Atomics.add
are not included because there isn't widespread ISA support for atomic read-modify-write of floating point values. The workaround is to compute locally and store with Atomics.compareExchange
.
Implementation guidance
Immutable shapes
Structs are declared with fixed layout up front. Engines should make an immutable shape for such objects. Optimizers can optimize field accesses without worrying about deopts.
Shared structs: make sure fields are pointer-width and aligned
Shared structs should store fields such that underlying architectures can perform atomic stores and loads. This usually means the fields should be at least pointer-width and aligned.
Shared structs: strings will be difficult
Except for strings, sharing primitives in the engine is usually trivial, especially for NaN-boxing implementations.
Strings in production engines have in-place mutation to transition representation in order to optimize for different use cases (e.g. ropes, slices, canonicalized, etc). Sharing strings will likely be the most challenging part of the implementation.
It is possible to support sharing strings by copying-on-sharing, but may be too slow. If possible, lockfree implementations of in-place mutations above is ideal.
Future work
Code sharing
Code sharing is not part of this proposal and thus shared structs cannot have methods. This may prove to be unergonomic enough that we bring code sharing in scope. Doing so would likely require a new kind of function cannot close over non-shared objects.
This proposal future-proofs by having shared struct instances throw when touching the [[Prototype]] slot. This will be relaxed with a future proposal when code sharing becomes possible.
See CODE-SHARING-IDEAS.md for a collection of ideas.
Typed fields
In the future, it may be sensible for more efficient memory representation ("packing") to also declare the type and size of fields. It is omitted from this proposal in that it is not a requirement for none of the primary motivations. At the same time, starting without types lets us add them incrementally in the future.
Shared structs: fast cloning
Since structs have fixed layout with an immutable [[Prototype]] slot and no accessors, they are amenable to fast cloning.