Log inSign up
Jan Leike
790 posts
user avatar
Jan Leike
@janleike
AI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.
San Francisco, USA
jan.leike.name
Joined March 2016
335
Following
132.8K
Followers
  • Pinned
    user avatar
    Jan Leike
    @janleike
    May 8
    Some personal news: I am starting a new research project at Anthropic. Very excited about this! Many things are needed to make AGI go well, and alignment is only one of them. More on this soon…
    213K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Yesterday was my last day as head of alignment, superalignment lead, and executive @OpenAI.
    6.1M
  • user avatar
    Jan Leike
    @janleike
    May 15, 2024
    I resigned
    6.8M
  • user avatar
    Jan Leike
    @janleike
    May 28, 2024
    I'm excited to join @AnthropicAI to continue the superalignment mission! My new team will work on scalable oversight, weak-to-strong generalization, and automated alignment research. If you're interested in joining, my dms are open.
    1.4M
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    To all OpenAI employees, I want to say: Learn to feel the AGI. Act with the gravitas appropriate for what you're building. I believe you can "ship" the cultural change that's needed. I am counting on you. The world is counting on you. :openai-heart:
    1.1M
  • user avatar
    Jan Leike
    @janleike
    Feb 3, 2025
    We challenge you to break our new jailbreaking defense! There are 8 levels. Can you find a single jailbreak to beat them all? claude.ai/constitutional…
    1.3M
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    I joined because I thought OpenAI would be the best place in the world to do this research. However, I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point.
    986K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    Building smarter-than-human machines is an inherently dangerous endeavor. OpenAI is shouldering an enormous responsibility on behalf of all of humanity.
    1.1M
  • user avatar
    Jan Leike
    @janleike
    Nov 20, 2023
    I think the OpenAI board should resign
    751K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    OpenAI must become a safety-first AGI company.
    719K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics.
    536K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    But over the past years, safety culture and processes have taken a backseat to shiny products.
    922K
  • user avatar
    Jan Leike
    @janleike
    May 17, 2024
    Replying to @janleike
    We are long overdue in getting incredibly serious about the implications of AGI. We must prioritize preparing for them as best we can. Only then can we ensure AGI benefits all of humanity.
    3.1M
  • user avatar
    Jan Leike
    @janleike
    Feb 13, 2023
    With the InstructGPT paper we found that our models generalized to follow instructions in non-English even though we almost exclusively trained on English. We still don't know why. I wish someone would figure this out.
    939K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement