TIMES  |  Tech

Why the World's Best AI Systems Are Still So Bad at Pokémon

为何全球最顶尖的人工智能系统依然在宝可梦上表现不佳

As advanced models stumble through a 1990s Game Boy classic, Pokémon is a surprisingly revealing test of what AI still can’t do.

As advanced models stumble through a 1990s Game Boy classic, Pokémon is a surprisingly revealing test of what AI still can’t do.

2026-01-13  1169  困难
字体大小

The quest to make a large language model (LLM) a Pokémon master began last February, when an Anthropic researcher launched a livestream of Claude playing the 1996 Game Boy game Pokémon Red to accompany the release of Claude Sonnet 3.7, at the time one of the world’s best models. As the company noted, this was the first Claude model that could meaningfully play the game at all (previous models “wandered aimlessly or got stuck in loops,” and could not get past the game’s opening beats). Within the first weeks, the stream attracted approximately 2,000 viewers, cheering Claude along in the public chat.

请登录后继续阅读完整文章

还没有账号?立即注册

成为会员后您将享受无限制的阅读体验,并可使用更多功能,了解更多


免责声明:本文来自网络公开资料,仅供学习交流,其观点和倾向不代表本站立场。