Proof & Testing

Support claims on Runtime.MiRust.com should map to evidence. The source code already treats tests as part of the architecture.

Test categories

Test laneWhat it protects
Rust unit testsMath ops, parser rules, tokenizer, quantization, adapters, packer operators, receipts, manifests.
WASM ABI smokeExport boundary, allocation, model load, generation, next token, adapter validation/apply, reset, free.
Browser harnessStatic app IDs, local-only markers, route declarations, manifest presence, content types.
Browser smokeHeadless browser behavior, model load, generation, Step Token, Adapter Sidecar, selector registries, module-plan drift rejection, assembly evidence checks.
Route soaksRepeated q4_0/q8_0/f32 generated assembly routes, reload behavior, no-store fetches, model release before reassembly.
Visual Studio scriptsBuild, test, copy WASM, check model artifacts, run browser harness, reuse local server.

Drift tests

Generated routes should reject checksum-consistent but semantically wrong metadata. The source references drift gates for:

Quality gates

A runtime smoke model proves runtime behavior. It does not prove assistant usefulness.

Assistant-quality claims need converted-trained provenance and a matching passed evaluation sidecar with declared cases, prompts, expected text, actual text, totals, and checksums.

Public evidence wording

Use phrases such as:

Avoid implying that deterministic smoke models are production assistants.