Log inSign up
Greg Burnham
8,125 posts
Image
user avatar
Greg Burnham
@GregHBurnham
Researcher at @EpochAIResearch
Brooklyn, NY
lemmata.substack.com
Joined March 2016
721
Following
4,329
Followers

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
  • user avatar
    Greg Burnham
    @GregHBurnham
    Aug 8, 2025
    Careful not to cut yourself on the jagged frontier
    Image
    882K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Jan 20, 2025
    Replying to @shreyabasu003 and @collnsmith
    With white beans
    102K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Aug 8, 2025
    Replying to @GregHBurnham
    Wrong 4/4 times with only the Thinking setting. Wrong only 2/4 times if I also append “Think really hard!”
    100K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Jan 1, 2024
    Replying to @zzdoublezz
    This tweet was one scroll down from yours
    user avatar
    ELLɅ MɅCHINɅ
    @orchidcamp
    Jan 1, 2024
    loved that silly little quip bro. my favorite part was the deep sadness masked just faintly behind it
    44K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Jul 24, 2022
    Since seeing this, any time I pass a samoyed in the street I mutter to myself, “hypoallergenic, if you can believe it”
    Age-restricted adult content. This content might not be appropriate for people under 18 years old. To view this media, you’ll need to log in to X. Learn more
  • user avatar
    Greg Burnham
    @GregHBurnham
    Aug 8, 2025
    Replying to @GregHBurnham
    This is the one GSM8K problem frontier models consistently struggle with, so it's a bit adversarially selected in that sense. I also think it's hard for humans! I got it wrong on my first try and... emailed the authors to point out the error. 🤦‍♂️ Source: platinum-bench.csail.mit.edu
    99K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Sep 21, 2024
    Replying to @peligrietzer
    Comparative advantage in the home economy
    18K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Jul 19, 2025
    Pretty happy with how my predictions are holding up. 5/6 was the gold medal threshold this year. OAI's "experimental reasoning LLM" got that exactly, failing only to solve the one hard combinatorics problem, P6. My advice remains: look beyond the medal. Brief thread. 1/
    Image
    47K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Jun 3, 2025
    New from me: the USAMO problem that Gemini 2.5 Pro Deep Think newly solved happens to be one I wrote about a lot earlier this year. It's solution differs from the most common human solution in one interesting way. Excerpts here, link in reply
    Image
    Image
    Image
    Image
    22K
  • user avatar
    Greg Burnham
    @GregHBurnham
    May 25, 2021
    Replying to @smithsmm and @lastpositivist
    Another great story of Sendak understanding kids’ perspectives
    Image
    Image
    Image
    Image
  • user avatar
    Greg Burnham
    @GregHBurnham
    Dec 29, 2024
    Cowardice cutting off the recursion like this
    Image
    4.7K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Sep 17, 2020
    Replying to @benkesslen and @oneunderscore__
    My fave
    user avatar
    Greg Burnham
    @GregHBurnham
    Aug 16, 2020
    No urban chaos quite like the block in Queens bounded by 58th Ave, 58th Pl, 58th Rd, and 58th St.
    Image
  • user avatar
    Greg Burnham
    @GregHBurnham
    Aug 11, 2025
    The 2025 AIME has fallen! On MathArena, GPT-5 (high) has solved the one problem that no prior model had solved, 2/4 times.
    Image
    9.5K
  • user avatar
    Greg Burnham
    @GregHBurnham
    Nov 19, 2020
    Replying to @limitlessjest
    Comrade, this is dangerous counterrevolutionary thinking
    Image
    Image
Advertisement
Advertisement