Tensor Commits - Theseus Docs

<1%

Proof generation overhead

<0.1%

Verification time

~2ms

Check time per proof

Overview

Tensor-commit protocols enable verifiable ML by proving a model was executed correctly. Traditional verification via recomputation is prohibitively expensive for large models.

Theseus' Tensor Commits provide batch verification and reduce opening costs through a novel application of KZG commitment schemes extended to multi-dimensional tensor structures.

How Tensor Commits Work

Figure Explanation

A sovereign agent's token stream is fed auto-regressively into an LM Transformer whose every weight tensor and intermediate activation is bound by a single tensor commitment. All per-block commitments live in a Terkle "tensor-commit" tree, so a small set of Merkle-path openings plus KZG pairings suffices to verify every attention score, residual update, non-linearity, and final logit computation.

Key Achievements

<1% Proof Generation

Minimal impact on inference performance. Practical for production workloads.

<0.1% Verification Time

Verifiers check proofs in milliseconds. Thousands can audit simultaneously.

Efficient & Scalable

O(log n) verification complexity

Proof size <1MB for frontier models

1000+ simultaneous verifiers

Sublinear scaling with model size

Terkle Trees

A Terkle tree (tensor Merkle tree) has leaves that are sub-tensors and internal nodes that carry tensor commitments instead of hash values.

Structure

• Each dimension j has mⱼ blocks
• Each leaf cℓ is a commitment of sub-tensor Tℓ
• Parents commit to children tensor concatenation
• Root cᵣₒₒₜ is the global model fingerprint

Benefits

• Batch verification: Multiple ops in one proof
• Selective opening: Without revealing full model
• Efficient proofs: Logarithmic proof size
• Hierarchical: Natural fit for NN layers

Verification Process

Model Registration

Prover uploads weights with Tensor Commit. Commitment stored on-chain as canonical fingerprint.

Inference Execution

Prover runs forward pass, emits proof with opening, input embeddings, layer outputs, and Merkle path.

Verification

Every verifier checks every inference. ~2ms check time, gossip once, 2/3 BFT agreement needed.

Performance Comparison

Per-op cost on Theseus, with proof generation and verification overhead included.

Operation	Latency	Proof Size	Gas Cost
TMATMUL 512x512	4.1 ms	230 KB	18K
TSTREAM 4x512	8.6 ms	400 KB	27K
TCOMMIT 70B	22 ms	470 KB	120K

* Gas costs based on base-load multiplier m = 1.0

Versus alternatives

How Tensor Commits compare to the two main approaches for verifying neural network inference: re-executing the model on every node, and zkML proofs.

Approach	Full re-execution	zkML (zk-SNARK)	Tensor Commits
Verifier work per inference	Same as the prover	Milliseconds (constant)	~2 ms
Prover overhead vs raw inference	0% (no separate proof)	1000-100,000x	<1%
Practical model size	Limited by smallest validator	Small models (mostly)	Frontier (70B+)
Proof size	Not applicable	~KB	~KB to MB
Hides model weights from verifier	No (verifier needs weights)	Yes	Yes

Re-execution is the design Ethereum uses for smart contracts and the reason on-chain inference at frontier sizes is impractical there. zkML produces succinct proofs but the prover-side overhead is what has kept it limited to small networks. Tensor Commits target the same proof-size benefit as zkML with overhead that does not break the economics for production-sized models.

LLM-Specific Optimizations

Token Embeddings

Committed polynomially with positional encoding using homomorphic properties

Layer Normalization

Mean/variance via polynomial commitments, inverse sqrt via polynomial approximation

Multi-Head Attention

Q, K, V matrices committed individually, attention scores polynomially approximated

Residual Connections

Handled via commitment homomorphism, layers reuse prior commitments

Mixture-of-Experts

Sparse expert activations committed efficiently, only activated experts contribute

Why This Matters

No recomputation:Verifiers don't re-run the entire model

Hardware independence:Proofs valid regardless of hardware

Privacy preserving:Weights remain private, computation verifiable

Scalable verification:Thousands of validators simultaneously

Tensor Commits Protocol