Open-weight AI model GLM 5.2 delivers Opus-level smarts at startup prices

Jun 24, 2026 · Lenny's Podcast
🎧 PodShort 15 min squeezed to 2 AI SprinklerAS AI / ML New
Full episode from Lenny's Podcast
Key Insights
  • GLM 5.2 is an open-weight model, meaning its trained model weights are publicly available for download, allowing users to run it on their own hardware, fine-tune it on their own data, and inspect its workings.
  • GLM 5.2 is achieving intelligence levels comparable to Opus or GPT-5.5 at a fraction of the cost, making it a significant development for those looking to self-host AI models.
  • The model's context window is large, supporting a million tokens, but it is limited to text-in and text-out, meaning it cannot process or generate images.
  • GLM 5.2 performs well in benchmarking against models like Opus and GPT-5.5, often inching above GPT-5.5 and nearly matching Claude Opus 4.8, while certainly outperforming Gemini 3.1 Pro.
  • The model demonstrated strong capabilities in exploring an existing codebase, providing a good overview of its architecture and recent developments, which is a common task for software engineers.
  • GLM 5.2's ability to redesign a website header section, including generating HTML and CSS, showed a surprisingly good design sense and adherence to existing design systems, suggesting its potential for design-related tasks.
  • Despite struggling with writing React code, GLM 5.2 successfully executed a long-running autonomous task of pulling error logs, analyzing them, and generating a prioritized fix plan in a well-designed HTML format.
  • The cost-effectiveness of GLM 5.2 is a major advantage, with a long-running task consuming only a few dollars for millions of tokens, making it a highly affordable alternative to more expensive proprietary models.
Metrics Mentioned
  • 1 million tokens (Context window size of GLM 5.2.)
  • $3.36 (Cost for about 6 million tokens used for various tasks, including a 45-minute long-running task.)
  • 72% (Cache rate for the tokens used.)
  • 20 (Number of Sentry errors identified by GLM 5.2.)
  • 5 (Number of Vercel log signals identified by GLM 5.2.)
  • 14 (Number of planned fixes generated by GLM 5.2.)

RevBots.ai View:

  • AI Sprinkler teams bolt this onto existing stacks for cheap dev productivity gains.
  • Open-weight models reduce dependency on SaaS vendors: an ARM precursor move.
  • Cost benchmarks show AI economics shifting; impacts SaaS Hoarder tool sprawl math.
🎧Full Episode:Lenny's Podcast →