SWE Agents Too Cheap To Meter, The Token Data War, and the rise of Tiny Teams
This Latent Space newsletter discusses the shift in Software Engineering (SWE) agent pricing, highlighting the trend of offering free or heavily discounted access to coding agents like Codex and Jules in exchange for user data. This data is valuable for training increasingly sophisticated models. The newsletter also touches on the rise of "Tiny Teams," small, highly productive teams augmented by AI, achieving significant ARR with few employees.
-
The Token Data War: The primary focus is on the evolving pricing models for SWE agents. Companies are now willing to offer them at no cost to acquire user data, recognizing its value in training better models.
-
Value of User Data: Elite user data is becoming increasingly valuable as generic datasets like GitHub and StackOverflow are exhausted. Some companies are even paying experts exorbitant salaries to curate high-quality, niche datasets.
-
Rise of Tiny Teams: SWE Agents enable "Tiny Teams" to achieve outsized results (millions in ARR per employee), marking a shift towards highly efficient, AI-augmented teams.
-
LMArena's $100M Seed Round: The funding of LMArena, a platform for evaluating LLMs using human raters, signals the importance of high-quality human feedback data.
-
The free access to powerful coding agents like Codex and Jules suggests a strategic move by major players to gather extensive user data.
-
The success of companies like Perplexity and Cursor, despite negative profit margins, underscores the market's valuation of user data.
-
"Vibe coding" is contrasted with the more productive "Tiny Teams" concept, emphasizing the importance of tangible results in AI-augmented development.
-
The prediction that LMArena will eventually pay users for their contributions highlights the increasing value of human feedback in training AI models.