Seth Karten (@sethkarten) / X

Seth Karten

3,251 posts

Seth Karten

@sethkarten

Agents….Continual Harness, PokeAgent, LLM Economist | Research Intern @PrimeIntellect | CS PhD @Princeton | Former CMU Waymo

🐯

Joined October 2012

Pinned
Seth Karten
@sethkarten
May 13
Article
Gemini Plays Pokémon discovered something about agent harnesses. Continual Harness automates it.
Long-horizon embodied agency is a harness problem rather than a model-scale problem. Coding agents already work this way. Claude Code and OpenHands are scaffolding around the model (prompt, skills,...
31K
Seth Karten
@sethkarten
Oct 24, 2025
They say the first 100 citations are the hardest. Happy to achieve this small milestone 🎉
57K
Seth Karten
@sethkarten
Mar 7, 2025
Can a Large Language Model (LLM) with zero Pokémon-specific training achieve expert-level performance in competitive Pokémon battles? Introducing PokéChamp, our minimax LLM agent that reaches top 30%-10% human-level Elo on Pokémon Showdown! New paper on arXiv and code on github!
49K
Seth Karten
@sethkarten
Jun 12, 2022
Replying to @spandan_madan
Heaven exists and it's a browser extension
GitHub - j3soon/arxiv-utils: Meaningful titles for tabs and PDF downloads! Also supports tab search.
From github.com
Seth Karten
@sethkarten
May 30, 2025
Excited to announce that I will be spending the summer at @Waymo on the simulation realism team! I’ll be working on learning to generate simulated worlds. 🚙🚙🚙 Send me a message if youre in the bay and want to chat!
7.9K
Seth Karten
@sethkarten
Jul 14, 2025
🚀 Launch day! The NeurIPS 2025 PokéAgent Challenge is live. Two tracks: ① Showdown Battling – imperfect-info, turn-based strategy ② Pokemon Emerald Speedrunning – long horizon RPG planning 5 M labeled replays • starter kit • baselines. Bring your LLM, RL, or hybrid
57K
Seth Karten
@sethkarten
Apr 17, 2023
I am happy to share that I will be joining the PhD in Computer Science program at @Princeton with @chijinML as a Francis Robbins Upton and NSF GRFP Fellow. I am very grateful to my advisors, mentors, and peers at @SCSatCMU and @RutgersU over the past years for their support.
9.2K
Seth Karten
@sethkarten
Jun 10, 2022
Little know fact about Carnegie Mellon University’s robotics institute… we keep it stocked with popcorn so the place always smells like a movie theater🍿🎥
Seth Karten
@sethkarten
Jun 17, 2025
Wow, the exponential rollout at @Waymo is super exciting. Most deployed real-world autonomous agent and multi-agent system!
3.6K
Seth Karten
@sethkarten
May 26, 2025
Excited to share that the PokeAgent challenge was accepted as a @NeurIPSConf competition! This should serve as an excellent standardized benchmark for competitive games AND ‘speedrunning’ the RPG. I hope to see both the RL and LLM agent communities working together here to eval
8.2K
Seth Karten
@sethkarten
Mar 28, 2025
I'm excited to release the PokéChamp dataset! 🎮 2 million cleaned battle logs from Pokémon Showdown to help train expert-level AI agents for competitive Pokémon. Check out the data on @huggingface and code on GitHub! 🔗 huggingface.co/datasets/milkk… 🔗 github.com/sethkarten/pok…
Seth Karten
@sethkarten
Mar 7, 2025
Can a Large Language Model (LLM) with zero Pokémon-specific training achieve expert-level performance in competitive Pokémon battles? Introducing PokéChamp, our minimax LLM agent that reaches top 30%-10% human-level Elo on Pokémon Showdown! New paper on arXiv and code on github!
milkkarten/pokechamp · Datasets at Hugging Face
From huggingface.co
15K
Seth Karten
@sethkarten
Mar 5, 2025
Fantastic RL result from the Puffer group showing the sheer complexity of Pokemon Red as a benchmark. LLM agents need to take notes
Joseph Suarez 🐡
@jsuarez
Mar 5, 2025
We beat Pokemon Red with online RL! Details here over the next several days. Led by @dsrubinstein. Follow him, me, @DanAdvantage, @kywch500, @computerender for more!
7.5K
Seth Karten
@sethkarten
Jul 9, 2025
Heading to #ICML2025 next week! If you’re into all things API (Artificial Pokémon Intelligence) from our PokéChamp spotlight to the upcoming NeurIPS PokeAgent Challenge, LLM-agent scaffolding & reasoning, or mechanism-design nudging, let’s connect. DMs open!
4.2K
Seth Karten
@sethkarten
Aug 15, 2025
🎓 University students & AI researchers — push your Pokémon AI agents further! The NeurIPS 2025 PokéAgent Challenge is offering compute credits, courtesy of our sponsor Google DeepMind, to help you train bigger models & run more experiments. 📌 To apply: 1️⃣ Make a submission to
5.5K