Google, OpenAI, and Anthropic are competing to see whose AI can play Pokémon the best — Twitch streams of beloved RPG game test the models’ true might

Google, OpenAI, and Anthropic are competing to see whose AI can play Pokémon the best — Twitch streams of beloved RPG game test the models' true might

As Big Tech moves toward its goal of achieving AGI, inference will transition from simple answers to long-running, successive progress, which a game like Pokémon is perfect for. To finish the game, you have to win the Pokémon League, and that requires several steps in a row, testing the AI's strategic planning and resource management. It also makes the performance easily quantifiable instead of being subjective.

Previously, we covered another exercise in AI capabilities where a bunch of models were asked to build a clone of Minesweeper . OpenAI's Codex emerged as the winner there, with Google's Gemini failing to even produce a playable game. That was a much easier ask, so something as complex as even a retro RPG is certainly a step-up in assessment criteria.

Follow Tom's Hardware on Google News , or add us as a preferred source , to get our latest news, analysis, & reviews in your feeds.

Hassam Nasir is a die-hard hardware enthusiast with years of experience as a tech editor and writer, focusing on detailed CPU comparisons and general hardware news. When he\u2019s not working, you\u2019ll find him bending tubes for his ever-evolving custom water-loop gaming rig or benchmarking the latest CPUs and GPUs just for fun. ","collapsible":{"enabled":true,"maxHeight":250,"readMoreText":"Read more","readLessText":"Read less"}}), "https://slice.vanilla.futurecdn.net/13-4-11/js/authorBio.js"); } else { console.error('%c FTE ','background: #9306F9; color: #ffffff','no lazy slice hydration function available'); } Hassam Nasir Social Links Navigation Contributing Writer Hassam Nasir is a die-hard hardware enthusiast with years of experience as a tech editor and writer, focusing on detailed CPU comparisons and general hardware news. When he’s not working, you’ll find him bending tubes for his ever-evolving custom water-loop gaming rig or benchmarking the latest CPUs and GPUs just for fun.

Sam Hobbs Can any of them do Jeopardy? Those three should compete on Jeopardy. Reply

Key considerations

  • Investor positioning can change fast
  • Volatility remains possible near catalysts
  • Macro rates and liquidity can dominate flows

Reference reading

More on this site

Informational only. No financial advice. Do your own research.

Leave a Comment