Open-weight AI model GLM 5.2 delivers Opus-level smarts at startup prices

Jun 24, 2026 · Lenny's Podcast

🎧 PodShort 15 min squeezed to 2 AI / ML New

Full episode from Lenny's Podcast

Key Insights

GLM 5.2 is an open-weight model, meaning its trained model weights are publicly available for download, allowing users to run it on their own hardware, fine-tune it on their own data, and inspect its workings.
GLM 5.2 is achieving intelligence levels comparable to Opus or GPT-5.5 at a fraction of the cost, making it a significant development for those looking to self-host AI models.
The model's context window is large, supporting a million tokens, but it is limited to text-in and text-out, meaning it cannot process or generate images.
GLM 5.2 performs well in benchmarking against models like Opus and GPT-5.5, often inching above GPT-5.5 and nearly matching Claude Opus 4.8, while certainly outperforming Gemini 3.1 Pro.
The model demonstrated strong capabilities in exploring an existing codebase, providing a good overview of its architecture and recent developments, which is a common task for software engineers.
GLM 5.2's ability to redesign a website header section, including generating HTML and CSS, showed a surprisingly good design sense and adherence to existing design systems, suggesting its potential for design-related tasks.
Despite struggling with writing React code, GLM 5.2 successfully executed a long-running autonomous task of pulling error logs, analyzing them, and generating a prioritized fix plan in a well-designed HTML format.
The cost-effectiveness of GLM 5.2 is a major advantage, with a long-running task consuming only a few dollars for millions of tokens, making it a highly affordable alternative to more expensive proprietary models.

Metrics Mentioned

1 million tokens (Context window size of GLM 5.2.)
$3.36 (Cost for about 6 million tokens used for various tasks, including a 45-minute long-running task.)
72% (Cache rate for the tokens used.)
20 (Number of Sentry errors identified by GLM 5.2.)
5 (Number of Vercel log signals identified by GLM 5.2.)
14 (Number of planned fixes generated by GLM 5.2.)

RevBots.ai View:

AI Sprinkler teams bolt this onto existing stacks for cheap dev productivity gains.
Open-weight models reduce dependency on SaaS vendors: an ARM precursor move.
Cost benchmarks show AI economics shifting; impacts SaaS Hoarder tool sprawl math.

🎧Full Episode:Lenny's Podcast →

RevBots.ai View:

Join The RevBots ARMy