Log inSign up
Seth Karten
Prime Intellect
3,251 posts
Image
user avatar
Seth Karten
Prime Intellect
@sethkarten
Agents….Continual Harness, PokeAgent, LLM Economist | Research Intern @PrimeIntellect | CS PhD @Princeton | Former CMU Waymo
🐯
sethkarten.ai
Joined October 2012
667
Following
2,240
Followers
  • Pinned
    user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    May 13
    Article cover image
    Article
    Gemini Plays Pokémon discovered something about agent harnesses. Continual Harness automates it.
    Long-horizon embodied agency is a harness problem rather than a model-scale problem. Coding agents already work this way. Claude Code and OpenHands are scaffolding around the model (prompt, skills,...
    31K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Oct 24, 2025
    They say the first 100 citations are the hardest. Happy to achieve this small milestone 🎉
    Image
    57K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Mar 7, 2025
    Can a Large Language Model (LLM) with zero Pokémon-specific training achieve expert-level performance in competitive Pokémon battles? Introducing PokéChamp, our minimax LLM agent that reaches top 30%-10% human-level Elo on Pokémon Showdown! New paper on arXiv and code on github!
    Image
    49K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Jun 12, 2022
    Replying to @spandan_madan
    Heaven exists and it's a browser extension
    Image
    GitHub - j3soon/arxiv-utils: Meaningful titles for tabs and PDF downloads! Also supports tab search.
    From github.com
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    May 30, 2025
    Excited to announce that I will be spending the summer at @Waymo on the simulation realism team! I’ll be working on learning to generate simulated worlds. 🚙🚙🚙 Send me a message if youre in the bay and want to chat!
    Image
    7.9K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Jul 14, 2025
    🚀 Launch day! The NeurIPS 2025 PokéAgent Challenge is live. Two tracks: ① Showdown Battling – imperfect-info, turn-based strategy ② Pokemon Emerald Speedrunning – long horizon RPG planning 5 M labeled replays • starter kit • baselines. Bring your LLM, RL, or hybrid
    Banner reading “PokéAgent Challenge @ NeurIPS 2025” with two panels: Track 1 – Competitive Pokémon Battle Bots, Track 2 – Long-Horizon RPG Gameplay. Call-to-action: “Create video-game AI! Win prizes! Live now at pokeagent.github.io.”
    57K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Apr 17, 2023
    I am happy to share that I will be joining the PhD in Computer Science program at @Princeton with @chijinML as a Francis Robbins Upton and NSF GRFP Fellow. I am very grateful to my advisors, mentors, and peers at @SCSatCMU and @RutgersU over the past years for their support.
    9.2K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Jun 10, 2022
    Little know fact about Carnegie Mellon University’s robotics institute… we keep it stocked with popcorn so the place always smells like a movie theater🍿🎥
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Jun 17, 2025
    Wow, the exponential rollout at @Waymo is super exciting. Most deployed real-world autonomous agent and multi-agent system!
    Image
    Image
    Image
    3.6K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    May 26, 2025
    Excited to share that the PokeAgent challenge was accepted as a @NeurIPSConf competition! This should serve as an excellent standardized benchmark for competitive games AND ‘speedrunning’ the RPG. I hope to see both the RL and LLM agent communities working together here to eval
    Image
    8.2K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Mar 28, 2025
    I'm excited to release the PokéChamp dataset! 🎮 2 million cleaned battle logs from Pokémon Showdown to help train expert-level AI agents for competitive Pokémon. Check out the data on @huggingface and code on GitHub! 🔗 huggingface.co/datasets/milkk… 🔗 github.com/sethkarten/pok…
    user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Mar 7, 2025
    Can a Large Language Model (LLM) with zero Pokémon-specific training achieve expert-level performance in competitive Pokémon battles? Introducing PokéChamp, our minimax LLM agent that reaches top 30%-10% human-level Elo on Pokémon Showdown! New paper on arXiv and code on github!
    Image
    Image
    milkkarten/pokechamp · Datasets at Hugging Face
    From huggingface.co
    15K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Mar 5, 2025
    Fantastic RL result from the Puffer group showing the sheer complexity of Pokemon Red as a benchmark. LLM agents need to take notes
    user avatar
    Joseph Suarez 🐡
    @jsuarez
    Mar 5, 2025
    We beat Pokemon Red with online RL! Details here over the next several days. Led by @dsrubinstein. Follow him, me, @DanAdvantage, @kywch500, @computerender for more!
    7.5K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Jul 9, 2025
    Heading to #ICML2025 next week! If you’re into all things API (Artificial Pokémon Intelligence) from our PokéChamp spotlight to the upcoming NeurIPS PokeAgent Challenge, LLM-agent scaffolding & reasoning, or mechanism-design nudging, let’s connect. DMs open!
    4.2K
  • user avatar
    Seth Karten
    Prime Intellect
    @sethkarten
    Aug 15, 2025
    🎓 University students & AI researchers — push your Pokémon AI agents further! The NeurIPS 2025 PokéAgent Challenge is offering compute credits, courtesy of our sponsor Google DeepMind, to help you train bigger models & run more experiments. 📌 To apply: 1️⃣ Make a submission to
    5.5K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement