Today, HUD is excited to share our Series A funding!
We are the platform for building high quality post training datasets. Over 50 businesses use HUD to build RL environments, sell them to AI labs, or train their own models from them.
Our mission is to enable a generation of
HUD is a very interesting company, the big idea here is to provide entrepreneurs anywhere in the world the tools and infrastructure they need to get into the data business.
HUD : ScaleAI :: Airbnb : Hilton
Today, HUD is excited to share our Series A funding!
We are the platform for building high quality post training datasets. Over 50 businesses use HUD to build RL environments, sell them to AI labs, or train their own models from them.
Our mission is to enable a generation of
Today, HUD is excited to share our Series A funding!
We are the platform for building high quality post training datasets. Over 50 businesses use HUD to build RL environments, sell them to AI labs, or train their own models from them.
Our mission is to enable a generation of
Announcing HUD's RL environments for RSI hackathon! 🎉
Join us June 20–21 in SF if you're interested in RL and want to push the frontier forward!
(w/$100,000+ in prizes and compute credits 👀)
Announcing HUD's RL environments for RSI hackathon! 🎉
Join us June 20–21 in SF if you're interested in RL and want to push the frontier forward!
(w/$100,000+ in prizes and compute credits 👀)
You can improve models at anything you can verify. The only question left: what will you teach them?
Imagine what 2040 looks like. Then work backwards. Build environments and agents to push frontier in coding, ML research, robotics, manufacturing, autonomous businesses.
This Tuesday HUD is hosting Strange Evals.
This session: if VLM reasoning benchmark are saturated why cant claude make me a decent PPT?
DM if you’d like to join!
For my eval-maxxing nerds out there, good friends of mine are running a series called "strange evals", you can benchmaxx now on anything. If in SF swing by! luma.com/lvqbs1mo
AI agents are deploying to prod, but can they autonomously find and patch unseen critical vulnerabilities?
We introduce ZeroDayBench, a benchmark for evaluating LLM agents on proactive cyberdefense.
Plus, a novel high-severity (CVSS 8.1) CVE we found partway through ... 👀
While creating ZeroDayBench, a member of our team discovered CVE-2025-14279, a high-severity DNS rebinding vulnerability in the MLFlow REST server allowing full read/write access to a user’s endpoint w/o authentication.
Read more on: huntr.com/bounties/ef478…