Leonard Tang (@leonardtang

Leonard Tang

2,236 posts

Leonard Tang

@leonardtang_

ceo @haizelabs

nyc

Joined May 2013

Pinned
Leonard Tang
@leonardtang_
Apr 3
Article
Towards Semantic Observability
Towards Semantic Observability Traditional observability works because the set of behaviors of failures was known ex ante. Engineers know what to look for in advance: A request failed. Latency...
44K
Leonard Tang
@leonardtang_
May 27, 2025
You don’t need frontier lab resources for frontier lab automated LLM evaluation. To prove this, we’re open-sourcing j1-nano and j1-micro: two absurdly tiny (600M & 1.7B parameters) but mighty reward models competitive with orders-of-magnitude larger peers. j1-nano and j1-micro
90K
Leonard Tang
@leonardtang_
Feb 19, 2025
First came pre-training scaling; then came inference-time scaling. Now comes judge-time scaling. Despite progress in AI through scaled inference-time compute, AI remains unreliable in open-ended, non-verifiable domains. The key limitation is not generation—it is evaluation.
00:00
111K
Leonard Tang
@leonardtang_
Aug 15, 2025
born to do research forced to build b2b saas
17K
Leonard Tang
@leonardtang_
Feb 12, 2025
i've been entirely consumed these past few weeks by the LLM-as-a-judge research agenda. there's lots of great work, but there's also lots of noise, confusion, and redundancy in the literature. i’ve started curating the highest-quality reads here:
GitHub - haizelabs/Awesome-LLM-Judges: ⚖️ Awesome LLM Judges ⚖️
From github.com
15K
Leonard Tang
@leonardtang_
Sep 30, 2024
honored to be amongst such amazing peers and community🫶
AI Grant
@aigrant
Sep 30, 2024
We are excited to announce the startups accepted into the fourth batch of AI Grant! See the thread below as well as aigrant.com to learn more
33K
Leonard Tang
@leonardtang_
Jun 12, 2024
super excited to share what we've been cooking up at @haizelabs🕊️🕊️ we are now in the era of grossly excessive AI hype and demoware. but it is high time to recalibrate and revisit the difficult, unsexy, underlying problem that everybody is avoiding -- the AI reliability and
Haize Labs
@haizelabs
Jun 12, 2024
Today is a bad, bad day to be a language model. Today, we announce the Haize Labs manifesto. @haizelabs haizes (automatically red-teams) AI systems to preemptively discover and eliminate any failure mode We showcase below one particular application of haizing: jailbreaking the
00:00
56K
Leonard Tang
@leonardtang_
May 8, 2025
w a t
23K
Leonard Tang
@leonardtang_
Apr 21, 2025
come join @qw3rtman @willccbb and myself for the inaugural communion of the NYC AI Reading Group! > where: @haizelabs hq > when: sunday 4/27 @ 11 am > what: inference-time scaling for generalist reward modeling from @deepseek_ai > who: awesome people like yourself :^)
19K
Leonard Tang
@leonardtang_
Apr 27, 2025
nyc ai 🚀🚀🚀 scintillating discussion on this fine sunday morning. much more to come. @qw3rtman @willccbb @haizelabs
11K
Leonard Tang
@leonardtang_
Oct 12, 2024
sorry that @haizelabs research is turning into students' homework :( github.com/haizelabs/llam…
12K
Leonard Tang
@leonardtang_
May 23, 2025
session #2 of the NYC AI Reading Group w/ @qw3rtman @willccbb is in order! > where: @haizelabs hq > when: thursday 5/29 @ 7:30 pm > what: sft memorizes, rl generalizes: a comparative study of foundation model post-training > who: awesome people like yourself :^) > also:
24K
Leonard Tang
@leonardtang_
Mar 10, 2025
we're looking for outlier talent to join the @haizelabs research team if you're interested in: - robustness of real-world AI - active learning - ultra-efficient model tuning - synthetic data generation - reward modeling - weak supervision dm us or apply below!
14K
Leonard Tang
@leonardtang_
Oct 24, 2025
We are thrilled to welcome Professor He He @hhexiy as an advisor to the Haize Labs team! Professor He leads a group at NYU focused on evaluation, scalable oversight, human–AI collaboration, and reasoning.
22K