Log inSign up
Leonard Tang
2,236 posts
Image
user avatar
Leonard Tang
@leonardtang_
ceo @haizelabs
nyc
leonardtang.me
Joined May 2013
1,848
Following
4,367
Followers
  • Pinned
    user avatar
    Leonard Tang
    @leonardtang_
    Apr 3
    Article cover image
    Article
    Towards Semantic Observability
    Towards Semantic Observability Traditional observability works because the set of behaviors of failures was known ex ante. Engineers know what to look for in advance: A request failed. Latency...
    44K
  • user avatar
    Leonard Tang
    @leonardtang_
    May 27, 2025
    You don’t need frontier lab resources for frontier lab automated LLM evaluation. To prove this, we’re open-sourcing j1-nano and j1-micro: two absurdly tiny (600M & 1.7B parameters) but mighty reward models competitive with orders-of-magnitude larger peers. j1-nano and j1-micro
    Image
    90K
  • user avatar
    Leonard Tang
    @leonardtang_
    Feb 19, 2025
    First came pre-training scaling; then came inference-time scaling. Now comes judge-time scaling. Despite progress in AI through scaled inference-time compute, AI remains unreliable in open-ended, non-verifiable domains. The key limitation is not generationβ€”it is evaluation.
    Image
    00:00
    111K
  • user avatar
    Leonard Tang
    @leonardtang_
    Aug 15, 2025
    born to do research forced to build b2b saas
    17K
  • user avatar
    Leonard Tang
    @leonardtang_
    Feb 12, 2025
    i've been entirely consumed these past few weeks by the LLM-as-a-judge research agenda. there's lots of great work, but there's also lots of noise, confusion, and redundancy in the literature. i’ve started curating the highest-quality reads here:
    Image
    GitHub - haizelabs/Awesome-LLM-Judges: βš–οΈ Awesome LLM Judges βš–οΈ
    From github.com
    15K
  • user avatar
    Leonard Tang
    @leonardtang_
    Sep 30, 2024
    honored to be amongst such amazing peers and community🫢
    user avatar
    AI Grant
    @aigrant
    Sep 30, 2024
    We are excited to announce the startups accepted into the fourth batch of AI Grant! See the thread below as well as aigrant.com to learn more
    Image
    33K
  • user avatar
    Leonard Tang
    @leonardtang_
    Jun 12, 2024
    super excited to share what we've been cooking up at @haizelabsπŸ•ŠοΈπŸ•ŠοΈ we are now in the era of grossly excessive AI hype and demoware. but it is high time to recalibrate and revisit the difficult, unsexy, underlying problem that everybody is avoiding -- the AI reliability and
    user avatar
    Haize Labs
    @haizelabs
    Jun 12, 2024
    Today is a bad, bad day to be a language model. Today, we announce the Haize Labs manifesto. @haizelabs haizes (automatically red-teams) AI systems to preemptively discover and eliminate any failure mode We showcase below one particular application of haizing: jailbreaking the
    Image
    00:00
    56K
  • user avatar
    Leonard Tang
    @leonardtang_
    May 8, 2025
    w a t
    Image
    23K
  • user avatar
    Leonard Tang
    @leonardtang_
    Apr 21, 2025
    come join @qw3rtman @willccbb and myself for the inaugural communion of the NYC AI Reading Group! > where: @haizelabs hq > when: sunday 4/27 @ 11 am > what: inference-time scaling for generalist reward modeling from @deepseek_ai > who: awesome people like yourself :^)
    19K
  • user avatar
    Leonard Tang
    @leonardtang_
    Apr 27, 2025
    nyc ai πŸš€πŸš€πŸš€ scintillating discussion on this fine sunday morning. much more to come. @qw3rtman @willccbb @haizelabs
    Image
    11K
  • user avatar
    Leonard Tang
    @leonardtang_
    Oct 12, 2024
    sorry that @haizelabs research is turning into students' homework :( github.com/haizelabs/llam…
    Image
    12K
  • user avatar
    Leonard Tang
    @leonardtang_
    May 23, 2025
    session #2 of the NYC AI Reading Group w/ @qw3rtman @willccbb is in order! > where: @haizelabs hq > when: thursday 5/29 @ 7:30 pm > what: sft memorizes, rl generalizes: a comparative study of foundation model post-training > who: awesome people like yourself :^) > also:
    24K
  • user avatar
    Leonard Tang
    @leonardtang_
    Mar 10, 2025
    we're looking for outlier talent to join the @haizelabs research team if you're interested in: - robustness of real-world AI - active learning - ultra-efficient model tuning - synthetic data generation - reward modeling - weak supervision dm us or apply below!
    14K
  • user avatar
    Leonard Tang
    @leonardtang_
    Oct 24, 2025
    We are thrilled to welcome Professor He He @hhexiy as an advisor to the Haize Labs team! Professor He leads a group at NYU focused on evaluation, scalable oversight, human–AI collaboration, and reasoning.
    Image
    22K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsΒ·PrivacyΒ·CookiesΒ·AccessibilityΒ·Ads InfoΒ·Β© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement