Rewriting

What needs to be rewritten

In proxies, there are a couple of things you need to rewrite:

JS
HTML
Resource URLs - Any time you import a resource, the URL needs to be rewritten.
Sourcemaps - These aren’t necessary, and if it is your first time making a SW proxy I don’t recommend looking into this. The best solution is to force Scramtags, which will “undo” the rewriting.
Manifests - When I say this, I particularly mean Web App Manifests and soon Sub App Manifests. Cache Manifests are deprecated now and removed from all major browsers, so they aren’t a concern. These are important because it allows the site to spawn a PWA, but you can only have one PWA per-origin. It’s best to remove all references to PWAs and let proxy browser implementations use your sandboxing system.

Rewriting Specifics

HTML Rewriting

Recommended Libraries

The DOMUtils libraries will take you a long way, and is recommended.

The `MutationObserver` approach and why it is not recommended

It’s slow

A short overview on DOM painting

CSS Rewriting

Don’t even bother, RegExp is good enough. You only need to rewrite import() at-rule statements.

JS Rewriting

JS Rewriting is requried because you can’t monkeypatch location objects, since they are non-configurable properties and trying to reassign it to another object results in a navigation, or eval, because of web limitations. JS rewriting is the biggest hurdle you will have when making your proxy, until you get to advanced emulation features. It is considered the test, and once you pass it you become a proper proxy dev.

Recommended Libraries

Watch out and don’t use unmaintained libraries like esprima, where you will be stuck with ES5-only

Feel free to experiment with your own stack along the way. SWC is also a strong choice if you favor an expansive ecosystem with plugins and features like transformers. You may also use the direct Rust libraries and use WASM Bindgen like how Scramjet does if you are familiar with and prefer Rust.

Methods

⭐ DPSC

DPSC is short for Deep Property Scope Checking

DPSC involves rewriting the JS

It is recommended that you use this stack

Parse the script into AST, and then walk through it with a compatible AST walker. Try to find any possible object property access (through MemberExpression traversal in the AST) and check if it is to window.location. You can do this by wrapping the access in a function and checking if it is window.location. Remember, bracket access is a thing, too. You don’t want to overwhelm the call stack, so don’t blindly try to wrap every possible object property access. What I mean by this is check for any global object references such as window, self, globalthis, parent, top, and start looking for possible accesses only if it starts with the base object identifier being these. There you go, full DPSC (oversimplified).

This is commonly accoplished by wrapping all member expressions’ properties in a wrapper function, which is fine initially. To allow more features (and potentially improve Google Botguard score), Precise DPSC passes the object being accessed into the property rewriter as well, allowing more targeted propty rewriting.

Location Escapes

I wish it was as simple as above, but there are edge-cases, where the site can escape your JS rewriting. I show examples where a popup of the real URL shows whenever a location escape is found.

Location Set

Escape: location = "javascript:alert(location.href)"

Solved with Try Set

####### Location for…of Set

Escape: for (location of ["javascript:alert(location.href)"]);

Solved with Try Set+

Window Destructure

Escape:

const { location: x } = window;
alert(location.href)

Window Object Spread

Escape:

const x = { ...window };
alert(x.location.href);

Update Expressions

Escape:

(window.x = window).y += 0
alert(x.location.href)

Base URL Normalization

Escape:

const a = document.createElement("a");
a.href = ".";
alert(a.href);

Escape (alt 1):

const img = new Image()
img.src = "."
alert(img.src)

Escape (alt 2):

const form = document.createElement("form")
alert(form.action)

Escape (alt 3):

alert(new Request(".").url)

This may differ with other Base URLs being set

Escape 2:

const base = document.createElement("base");
base.href = location.origin;
document.head.prepend(base);

const x = document.createElement("a")
x.href = "."
// Will only alert the origin
alert(x.href);

base.remove()

Indirect Eval Global

Escape: alert((0, eval)("location.href"))

Function Constructor Global

Escape: alert(Function("return this")().location.href)

Reflect

Escape: alert(Reflect.get(location, "href", location))

Getter Rebind

Throw Window

Escape:

try {
  throw window;
} catch ({...x}) {
  alert(x.location.href)
}

ES6 Proxy object

 const win = new Proxy(window, {
  get(target, prop, receiver) {
    alert(target.location.href);
  },
}).x

You can take a proxify approach if you want to avoid heavy AST tree traversal for patching this escape, however it is very hacky in nature.

{
  const REGEXP_REVEAL_ORIGINAL_TARGET = /Error\\n\s\s\s\sat\s([a-zA-Z.]*)/;
  const proxyProxy = new Proxy(Proxy, {
    construct(target, args) {
      let [pTarget, handler] = args;

      if ("apply" in handler) {
        const originalApplyHandler = handler.apply;
        args[1] = (_target, _that, pArgs) => {
          // _that is `null`
          // _target doesn't work here
          const pTargetBak = pTarget;
          pTarget = () => new Error().stack;
          const revealingStackError = pTarget();
          // Restore functionality of the target method
          pTarget = pTargetBak;
          // Get the parents that contain the method
          const targetName = pTarget.name;
          const parentObjTree = [
            ...revealingStackError.matchAll(REGEXP_REVEAL_ORIGINAL_TARGET),
          ][0].split(`.${targetName}`)[0];
          let pThat = window[parentObjTree];
          if (pThat === window)
            // @ts-ignore
            pThat = proxyProxy;
          return originalApplyHandler(pTarget, pThat, pArgs);
        };
      }

      return Reflect.construct(target, args);
    },
  });
  Proxy = proxyProxy;
}

contentEditable Script

You don’t have to worry about this, since it requires specific user interaction to trigger anyway.

⭐ Eval rewriting

No matter what method you chose, will also need to rewrite calls to eval, since that can’t be overridden, even inside of a jail system. Don’t worry about jail systems; they are shortcuts that only work in bundle scripts and are not worth even having an implementation of one in your proxy.

All you have to do here is rewrite the script passed inside of eval by putting a function inside of the call to get the script content being passed into it and check if it is a string before writing, of course.

You can’t proxify eval, because it would change the scope in which it is being executed.

⭐ Source Mapping

Another hurdle for rewriting JS is dealing with Function.prototype.toString. This function obtains the verbatim source code of a function, meaning rewritten functions would show their rewrites in it. To combat that problem, you can start by intercepting Function.prototype.toString, and then use

Byte Span Rewrites

This method limits what your rewriter can do in terms of output. Namely, it must create raw text strings and either insert or replace them at certain locations in the original source. This means plugins and transformers which operate on the AST won’t work, as well as requiring creative ways of handling JS output. A span of bytes which delimit a sourcemap-style structure is then associated with each function. In your interception, you can then find the byte span and use it to “unrewrite” your source code.

Scramjet uses this method to achieve speed, at the cost of forbidding AST (in Scramjet’s case, the OXC AST) manipulation and thus any plugins or transformers.

Source Replacement Rewrites

This method is significatly less efficient than Byte Span Rewrites, but allows any transformation to be done to the AST of the source. The whole original source is associated with each function, and your interception can read it out and display it when called.

This method is currently not used in Titanium Network proxies, but can be found in generic rewriters such as rewd.

Jails

Jails is highly discouraged. It can be used as a shortcut in the beginning, but eventually you are going to need to use DPSC. You can have a hybrid appoach, but at that point, why bother?

Jails is most useful for rewriting module scripts, since the variables defined in it are not globally scoped by default. This means that you can have very minimal rewriting for most bundles, and completely get away with it on sites like Discord.

You will need fake vars if you want to use the jails method outside of module scripts. These are needed, because when you scope the entire script, any variables you create in what would be in the global scope, will not be shared across other scripts.

You need to create an object, where you have a fake let/const setters, so that you can trap the variable calls, put them on the window so they are globally scoped, and then inside of the Window Proxy, which are passed into the IIFE, trap them in calls so that those variables aren’t seen on the window as a property, but emulated as a scope.

There is still the catch for both module and classic scripts, where you need to rewrite any eval calls. You can’t proxify eval, because you can’t overwrite eval in strict mode, and even if you in a normal script, it will mess up the context.

How fake vars work

Transform as such:

These examples are for let, but the same applies for const

Array Destructuring

let [var1, var2] = ...;

<fakeVar>.fakeArrayDestructure = ([var1, var2], ...);

Object Destructuring

let { var1, var2 } = ...;

<fakeVar>.fakeObjectDestructure =
 ([var1, var2], ...);

Normal variables

let x = ...;

let.x = ...;

Rewriting

What needs to be rewritten

Rewriting Specifics

HTML Rewriting

Recommended Libraries

The MutationObserver approach and why it is not recommended

A short overview on DOM painting

CSS Rewriting

JS Rewriting

Recommended Libraries

Methods

⭐ DPSC

Location Escapes

Location Set

Window Destructure

Window Object Spread

Update Expressions

Base URL Normalization

Indirect Eval Global

Function Constructor Global

Reflect

Getter Rebind

Throw Window

ES6 Proxy object

contentEditable Script

⭐ Eval rewriting

⭐ Source Mapping

Byte Span Rewrites

Source Replacement Rewrites

Jails

How fake vars work

Array Destructuring

Object Destructuring

Normal variables

The `MutationObserver` approach and why it is not recommended