Log inSign up
ngrok
2,026 posts
Image
user avatar
ngrok
@ngrokHQ
One gateway for all your traffic. Service status available at status.ngrok.com.
localhost
ngrok.com/signup
Joined October 2020
115
Following
4,622
Followers
  • Pinned
    user avatar
    ngrok
    @ngrokHQ
    20h
    You can now call models running on your own hardware through a hosted AI gateway. Privately, with one command: `ngrok http 8000 --url https://vllm.internal` Plus all the public models you know and love. ↓
    Image
    Route, Secure & Manage Any LLM with ngrok AI Gateway
    From ngrok.ai
    100K
  • ngrok reposted
    user avatar
    niji
    @nijikokun
    19h
    We spent a lot of time listening to feedback, and observing how users were working with our early access release. Super proud of the team and their efforts that went into this. If you want to route traffic to AI in your applications this is the gateway for you.
    user avatar
    ngrok
    @ngrokHQ
    20h
    You can now call models running on your own hardware through a hosted AI gateway. Privately, with one command: `ngrok http 8000 --url https://vllm.internal` Plus all the public models you know and love. ↓ ngrok.ai
    306
  • ngrok reposted
    user avatar
    Joel Hans
    @endentire
    22h
    I wired tiny OLED screens onto two Raspberry Pis so I can watch them find and forget each other. It's a live view of the kernel's ARP table: how every machine tracks its neighbors on a local network and maps IPs → MAC addresses.
    Image
    00:00
    1.4K
  • user avatar
    ngrok
    @ngrokHQ
    20h
    Replying to @ngrokHQ
    One flat fee: $0.05 per million tokens with your inference billed on top. Buy credits as you go. Full writeup → ngrok.com/blog/new-ngrok… Docs → ngrok.com/docs/ai-gatewa…
    67
  • user avatar
    ngrok
    @ngrokHQ
    20h
    Replying to @ngrokHQ
    When you route to a local model, it's full private connectivity: no publicly addressable URLs, no public IPs to allowlist, or ports to open, wherever you've got those GPUs humming. Works with Ollama, vLLM, LM Studio, and the neoclouds, too.
    90
  • user avatar
    ngrok
    @ngrokHQ
    20h
    Replying to @ngrokHQ
    One baseURL change and an access key gets you: → routing to every model, even dedicated ones on AWS, Azure, etc → automatic failover + retries when a model slows or fails → scoped keys per app → token, latency, and cost visibility by app, dev, and model
    Image
    211
  • user avatar
    ngrok
    @ngrokHQ
    Jun 30
    Todd gets it.
    user avatar
    Todd Chapman
    @TtheBC01
    Jun 30
    Replying to @ngrokHQ
    Ngrok lets me webhook into my agents which I appreciate
    947
  • user avatar
    ngrok
    @ngrokHQ
    Jun 30
    Yes, ngrok gives localhost a public URL. It's also developer infrastructure that routes and secures traffic to the wild thing you'll plug into prod next. Reach into customer networks. Put every bit of ingress behind one front door. Run one gateway for devices, APIs, and LLMs.
    Image
    00:00
    101K
  • user avatar
    ngrok
    @ngrokHQ
    Jun 30
    Our own @samwhoo ported Kubernetes to the browser. Like a real flippin' cluster with lifecycles, DNS, and a simulated network. It's ~100k lines of TypeScript, almost all written by LLMs, but with every line reviewed by hand to keep it slop-free ↓
    Image
    I ported Kubernetes to the browser | ngrok blog
    From ngrok.com
    6.8K
    user avatar
    ngrok
    @ngrokHQ
    Jun 30
    Those dots moving around a webpage? Real (enough) pods sending each other requests over a simulated network, all in your browser. Hit pause and the whole cluster freezes, because it runs on a fake clock. Go poke at the demo for yourself: webernetes-demo.ngrok.app
    Image
    00:00
    345
  • user avatar
    ngrok
    @ngrokHQ
    Jun 26
    This prompt caching post is still a banger and perhaps the perfect read for a Friday afternoon if you're light on token spend.
    Image
    Prompt caching: 10x cheaper LLM tokens, but how? | ngrok blog
    From ngrok.com
    1.7K
  • user avatar
    ngrok
    @ngrokHQ
    Jun 24
    In 1996, AOL went down for 19 hours. It pushed "NASA finds evidence of life on Mars" off the front page of the New York Times. We sponsored an SRE to write a human postmortem of that outage: the stuff five whys leave out.
    Image
    AOL was down (1996) | ngrok blog
    From ngrok.com
    304K
    user avatar
    ngrok
    @ngrokHQ
    Jun 24
    Replying to @ngrokHQ
    But in the end, the best material wasn't technical. It was all the very human stories happening around the outage. So Mac wonders: SREs treat the technology as the protagonist and the people affected as statistics… but maybe we've got it all backwards?
    134
    user avatar
    ngrok
    @ngrokHQ
    Jun 24
    Find more of Mac's writing at macchaffee.com!
    109
  • user avatar
    ngrok
    @ngrokHQ
    Jun 24
    Gather round and come on a journey with us back to 1996 when AOL went down ngrok.com/blog/aol-was-d…
    Image
    GIF
    217
  • user avatar
    ngrok
    @ngrokHQ
    Jun 23
    It's my favorite place
    user avatar
    Sakshi Sugandhi
    @SakshiSugandhi
    Jun 23
    Day 2 of vibe coding
    Image
    311

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement