Modal (@modal) / X

Modal

1,523 posts

Modal

@modal

AI infrastructure that developers love 💚 Run inference, sandboxes, batch processing, training, and many other things on Modal

New York City

Joined July 2022

Pinned
Modal
@modal
May 21
Article
Modal's Series C: Raising $355M at a $4.65B valuation
We’ve raised $355 million after growing fivefold since September, surpassing $300 million in annualized revenue. Our valuation is $4.65B post-money in a round led by @generalcatalyst and @Redpoint,...
579K
Modal reposted
Akshat Bubna
@akshat_b
9h
You no longer have to pick between the performance of a black box API and the flexibility and control of @modal. Auto Endpoints give you both. We're unlocking frontier performance for everyone without having to talk to sales or an FDE. More cooking here, stay tuned.
Modal
@modal
10h
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
6.4K
Modal reposted
Erik Bernhardsson
@bernhardsson
9h
Managed private LLM endpoints, now available for everyone in @modal. Deploy in a few clicks with the UI or a few keystrokes with our CLI. The coolest thing is that these are not black boxes – customers have full access to the code underneath.
Modal
@modal
10h
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
18K
Modal
@modal
10h
It is not too late to _actually_ own your inference. Introducing: Modal Auto Endpoints.
00:00
87K
Modal
@modal
10h
Introducing Modal Auto Endpoints: Optimized inference you actually own | Modal Blog
From modal.com
2.8K
Modal
@modal
Jun 22
.wait_until_ready(), set, go Building performant sandbox systems goes way beyond the initial container boot. We're unpacking what that means, and breaking down some tools to help you manage the entire lifecycle.
18K
Modal
@modal
Jun 22
Read here:
Unpacking sandbox startup latency: why started ≠ ready | Modal Blog
From modal.com
1.4K
Modal reposted
Connor
@cnnradams
Jun 22
light work
17K
Modal
@modal
Jun 18
We're hosting an art show with @GrayAreaorg in San Francisco! 💚 Submissions are open till July 15: modal.art
Gray Area
@GrayAreaorg
Jun 18
📢 We're partnering with @modal to offer a new development and exhibition opportunity for artists with sustained engagements in artificial intelligence and the arts. This global open call seeks proposals for creative projects that demonstrate the intentional use of AI to further
00:00
7K
Modal
@modal
Jun 16
Sandbox startup latency and scaling can make or break your RL training run. Great post breaking this down, shown using Modal Sandboxes.
SemiAnalysis
@SemiAnalysis_
Jun 16
RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…
9.7K
Modal reposted
Erik Bernhardsson
@bernhardsson
Jun 16
Our sandbox team has been on a crusade against every millisecond of latency and it's paying off. More cool results coming very soon!
12K
Modal
@modal
Jun 15
Article
Product Updates: VM Sandboxes, Lower-latency routing, Domain allowlisting for Sandboxes, and More
🌍 Lower latency with regional routing To improve latency, Functions can now run closer to where they're called from. Instead of routing through the default US East region, you can send a Function's...
6.1K
Modal
@modal
Jun 15
We worked with @lmsysorg and z-lab.ai to - integrate DFlash spec into @sgl_project - make it faster with overlap - train a DFlash drafter for @Alibaba_Qwen 397B-A17B The result: up to 4.3x greater throughput over baseline and 1.5x over native MTP.
40K
Modal
@modal
Jun 15
You can find the drafter on @huggingface, where we've each released an identical copy of the weights. Kinda like getting matching tats with your bestie Our copy is here: huggingface.co/modal-labs/Qwe… The repos include scripts that reproduce our benchmark showing superiority over MTP:
1.8K
Modal
@modal
Jun 15
You can read about DFlash, the SGLang Spec V2 overlap scheduler, and how it all came together on the @lmsysorg blog:
The next generation of speculative decoding: DFlash and Spec V2
From lmsys.org
1.6K
Modal reposted
mma1226
@mma12261
Jun 12
some bts ><🌾 📡 driggs ave and S 1st
5.3K
Modal reposted
sona dolasia
@teenychairs
Jun 12
williamsburg too 🗽
00:00
sona dolasia
@teenychairs
Jun 8
around sf 🌉
22K