Jan Leike (@janleike) / X

Jan Leike

790 posts

Jan Leike

@janleike

AI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.

San Francisco, USA

Joined March 2016

Pinned
Jan Leike
@janleike
May 8
Some personal news: I am starting a new research project at Anthropic. Very excited about this! Many things are needed to make AGI go well, and alignment is only one of them. More on this soon…
213K
Jan Leike
@janleike
May 17, 2024
Yesterday was my last day as head of alignment, superalignment lead, and executive @OpenAI.
6.1M
Jan Leike
@janleike
May 15, 2024
I resigned
6.8M
Jan Leike
@janleike
May 28, 2024
I'm excited to join @AnthropicAI to continue the superalignment mission! My new team will work on scalable oversight, weak-to-strong generalization, and automated alignment research. If you're interested in joining, my dms are open.
1.4M
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
To all OpenAI employees, I want to say: Learn to feel the AGI. Act with the gravitas appropriate for what you're building. I believe you can "ship" the cultural change that's needed. I am counting on you. The world is counting on you. :openai-heart:
1.1M
Jan Leike
@janleike
Feb 3, 2025
We challenge you to break our new jailbreaking defense! There are 8 levels. Can you find a single jailbreak to beat them all? claude.ai/constitutional…
1.3M
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
I joined because I thought OpenAI would be the best place in the world to do this research. However, I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point.
986K
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
Building smarter-than-human machines is an inherently dangerous endeavor. OpenAI is shouldering an enormous responsibility on behalf of all of humanity.
1.1M
Jan Leike
@janleike
Nov 20, 2023
I think the OpenAI board should resign
751K
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
OpenAI must become a safety-first AGI company.
719K
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics.
536K
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
But over the past years, safety culture and processes have taken a backseat to shiny products.
922K
Jan Leike
@janleike
May 17, 2024
Replying to @janleike
We are long overdue in getting incredibly serious about the implications of AGI. We must prioritize preparing for them as best we can. Only then can we ensure AGI benefits all of humanity.
3.1M
Jan Leike
@janleike
Feb 13, 2023
With the InstructGPT paper we found that our models generalized to follow instructions in non-English even though we almost exclusively trained on English. We still don't know why. I wish someone would figure this out.
939K