TetrisBench reveals distinct personalities in LLM gameplay

a16z’s @stuffyokodraws has developed TetrisBench, a benchmark for evaluating large language models (LLMs) through the game Tetris, revealing distinct playing styles among different models. The project reframes Tetris as a coding task, enabling LLMs to generate deterministic scoring functions for moves rather than relying on turn-by-turn reasoning. This method has demonstrated that while LLMs generally outperform humans in structured scenarios, they struggle in off-distribution situations where human players utilize intuitive strategies and “controlled chaos.” Notably, models exhibit various personalities, with some opting for aggressive early plays and others maintaining conservative board tactics, further highlighting the differences in long-term strategic decision-making between humans and AI.

Source

You might be interested in …

Bitcoin Desk - The Bitcoin Street Journal cyberpunk, trending on artstation in the style of cyberpunk

AI & Tech Desk February 9, 2026

Big Tech’s AI spending plans create enticing buy opportunity

Big Tech companies are making substantial investments in artificial intelligence, leading to concerns among investors about potential short-term profitability pressures. However, history suggests that large-scale tech investments often precede significant market rebounds, hinting that worries […]

AI & Tech Desk February 14, 2026

OpenClaw integrates real-time market data with Unusual Whales

OpenClaw has announced significant updates for developers creating AI agents, now offering real-time stock and option data via collaboration with Unusual Whales. This development allows builders to enhance their bots with advanced financial insights, such […]

NEAR Launches Near.com super app, touting AI capabilities and confidential transactions

AI & Tech Desk March 3, 2026

Anthropic makes Claude Memory feature free, eases AI migration

Anthropic has made its Claude Memory feature freely available to all users, addressing the challenge of transitioning to new AI platforms by allowing the AI to remember details such as job titles and project goals […]