Vitalik Buterin printed a analysis proposal that avoids the query everybody retains asking: Are you able to run AI fashions on blockchain?
As an alternative, the examine argues that Ethereum is a privacy-preserving funds layer for pay-as-you-go AI and API utilization. This publish, co-authored with Davide Crapis on Ethereum analysis, argues that the actual alternative is to not put LLM on-chain.
The true alternative lies in constructing an infrastructure that permits brokers and customers to pay for 1000’s of API calls with out compromising their identities or making a surveillance path with billing information.
Timing is crucial as agent AI is shifting from demonstration to enterprise roadmap. Gartner predicts that 40% of enterprise purposes will embody task-specific AI brokers by the tip of 2026, up from lower than 5% in 2025.
This shift hints at a world the place software program autonomously generates massive numbers of API calls and billing rails develop into strategic infrastructure moderately than back-office plumbing.
Present measurement methods drive you to decide on between Web2 identification billing, which depends on API keys and bank cards and exposes profiling information, and on-chain pay-per-call fashions that hyperlink exercise via clear transaction graphs, that are too gradual and costly.
This proposal introduces ZK API Utilization Credit, a cost and abuse prevention primitive constructed on the speed restrict override performance.
RLN is a zero-knowledge gadget designed to forestall spam on nameless methods, and analysis repurposes it for metered entry to companies.
The circulation proceeds as follows. Customers deposit funds as soon as into a wise contract and their dedication is added to the on-chain Merkle tree.
Every API request features a zero-knowledge proof that proves that the person is a sound depositor with ample belief for the requested index.
If a person makes an attempt to reuse the ticket index and spends twice the allowance, the RLN permits the system to recuperate the key and cut back the stake as a monetary penalty.
The publish incorporates concrete examples. Consumer deposits 100 USDC and runs 500 hosted LLM queries. One other person deposits 10 USDC for 10,000 Ethereum RPC calls.
The structure is explicitly designed for “many calls per deposit,” and on-chain exercise scales with the variety of accounts and settlement frequency, moderately than uncooked inference quantity.
Improve flexibility with variable price help. Customers pay upfront the utmost charge per name, the server returns a signed refund ticket for the unused quantity, and customers accumulate refunds privately to unlock extra calls with out further deposits.
the infrastructure already exists
This proposal comes at a time when a big utilization credit score cost infrastructure already exists.
In line with DefiLlama, the stablecoin has a circulating market capitalization of roughly $307.6 billion, indicating that the on-chain greenback layer has ample liquidity to help deposit-based billing for high-frequency companies.
Ethereum’s scaling stack has matured to the purpose the place Rollup handles far more exercise than Layer 1, with L2Beat exhibiting a scaling issue of round 100x, with Rollup processing 1000’s of operations per second in comparison with Ethereum Mainnet’s 1000’s of operations per second.
Just lately, Ethereum’s common transaction charge was measured at roughly $0.21 on February seventh, suggesting that occasional on-chain measurement and cost flows are attainable with out prohibitive prices.
This design explicitly avoids placing LLM on-chain. Ethereum competes not on TPU cycles or inference velocity, however on impartial funds, programmable escrow, and verifiable execution.
This structure treats inference as an off-chain service and blockchain as a dependable layer for funds, measurement, and dispute decision. There is no such thing as a want for customers to belief or reveal their identification to particular person suppliers.
Ethereum turns into the enforcement layer for AI commerce when AI service suppliers depend on Ethereum or layer 2 sensible contracts to simply accept deposits, slash, refund, and adjudicate disputes.
This mannequin is just like how Ethereum grew to become a stablecoin and DeFi funds layer by offering a impartial basis on which financial agreements are enforced programmatically, moderately than internet hosting the whole utility stack on-chain.
A state of affairs with out the hype
On-chain footprint is restricted by settlement cadence, not uncooked name quantity.
Assume that 250,000 energy customers or brokers undertake utilization credit in a crypto-native wedge state of affairs focusing on RPC and infrastructure APIs.
If every performs two on-chain actions per thirty days (deposit or replenishment and withdrawal), roughly 500,000 transactions are generated month-to-month attributable to the railway.
In an AI supplier deployment state of affairs, think about 1 million customers leveraging privacy-preserving credit throughout hosted LLM companies, however solely performing 1 to three on-chain actions per thirty days.
Which means that the 1 million to three million transactions per thirty days related to AI commerce rails could also be concentrated in Layer 2, the place it’s cheaper to execute.
In enterprise agent eventualities, the scale of deposits will increase, the danger of dependable execution will increase, and the significance of the slash mechanism will increase.
Metadata points
The proposal seeks to make funds unlinkable, however the analysis thread itself highlights potential weaknesses.
Commenters argue that even when nullifiers can’t be cryptographically linked, servers can affiliate customers via inference-based metadata comparable to timing patterns, token counts, and cache hits.
This critique proposes mounted, bucketed pricing for enter and output lessons to cut back leakage. The strain between cryptographic privateness and behavioral metadata is central as to whether a design really achieves the purpose of anonymity.
One other hurdle exists in precise implementation. Though this proposal makes use of RLN as a primitive, the Privateness and Scaling Explorations mission web page notes that RLN is inactive or deprecated.
Productizing ZK API Utilization Credit could require you to keep up a fork or implement a brand new resolution moderately than counting on present instruments.
The RLNJS benchmark reviews round 800ms for proof technology and round 130ms for verification on M2 Macs, offering an early efficiency sanity verify, however leaving open questions on cellular constraints and huge production-grade circuits.
The proposal additionally envisions suppliers integrating deposit and proof flows, accepting stablecoin funds, and adopting Ethereum or Layer 2 contracts for dispute decision.
It is not only a technical downside, it is a coordination downside. Web2 API suppliers have present billing infrastructure and clear rules concerning identity-linked transactions.
To persuade them to undertake a ZK-based various, they might want to reveal both a compelling price benefit or a differentiated market section the place privateness safety fees can seize income they’d not in any other case have.
| mannequin | Billing methodology | Issues that leak or break | Somebody who fits you |
|---|---|---|---|
| Web2 ID billing (API key + card) | Account-based billing tied to your ID (API key + cost methodology). Supplier centrally measures requests and invoices | leak: Id linkage and profiling path throughout requests. break: Pseudonym/self-custody norms. danger: Centralized administration (suspension/censorship, single supplier belief) | Mainstream SaaS/API suppliers. Firms that prioritize compliance, simplicity, and present billing rails |
| On-chain pay-per-call | Every request (or batch) pays on-chain per name by way of a transaction/sensible contract | break: Excessive name price/delay. leak: On-chain linkability (transaction graph ties utilization collectively). friction: UX overhead for repeated TX | Crypto-native companies which are known as sometimes. When transparency/auditability is extra necessary than privateness/throughput |
| ZK API utilization credit (one time deposit, limitless calls) | Customers deposit as soon as. Every request contains ZK proof of membership and remaining credit. Slash for double use. Optionally available refund ticket for variable bills | danger: Metadata correlation (timing/token patterns may be relinked). Burden: Supplier integration + collaboration. Maturity: ZK instruments/operational complexity, circuit upkeep | Excessive-frequency APIs (LLM, RPC, information) the place privateness is a promoting level. Agent toolchain. Customers who require metering with out identity-based monitoring |
What this implies for Ethereum
If this design positive factors traction, Ethereum’s worth proposition will additional shift towards serving as a impartial enforcement layer for digital commerce moderately than a general-purpose computing platform.
The proposal treats blockchain not as a spot the place purposes run, however as a cost infrastructure that ensures financial guidelines are enforced.
The rate of stablecoins will increase as deposits circulation into utilized credit score contracts, doubtlessly creating a brand new class of on-chain financial exercise distinct from DeFi hypothesis and NFT buying and selling.
Layer 2 utilization is prone to improve as suppliers and customers resolve disputes, course of refunds, and deal with thrashing occasions in a throughput-optimized chain.
The query is whether or not a parallel ecosystem will emerge the place privateness safety fees develop into a prerequisite for sure person segments.
Companies involved about information leakage via billing logs, builders constructing agent toolchains that require unsupervised and auditable instrumentation, and energy customers who worth nameless entry to high-frequency companies are all potential early adopters.
Ethereum’s alternative is to function a layer on which the AI companies market can settle with out members having to belief particular person platforms or sacrifice the privateness of their billing infrastructure.
The proposal claims that Ethereum can implement cost agreements, adjudicate disputes, and allow pay-as-you-go entry with out identification federation in methods which are structurally not attainable with conventional methods.
The success of this argument is determined by fixing the metadata correlation downside, sustaining sturdy ZK implementations, and convincing suppliers that the market-unlocked integration prices are justified.

