Log inSign up
Greg Diamos
815 posts
user avatar
Greg Diamos
@GregoryDiamos
I build AI supercomputers
linkedin.com/in/gregory-dia…
Joined April 2013
141
Following
3,009
Followers
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Dec 18, 2023
    If you are an AI startup blocked on GPUs, send me a note. At Lamini, we figured out how to use AMD GPUs, which gives us a relatively large supply compared to the rest of the market.
    288K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Oct 31, 2023
    Lamini now supports JSON output at full speed. We guarantee that your LLM produces a valid JSON object according to your spec. Works for any LLM we support, e.g. Mistral 7b show below. API Docs: lamini-ai.github.io/API_v2/complet…
    Image
    52K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Dec 18, 2023
    Replying to @remusrisnov
    IMO, ROCm is 90% of the way there, but the market treats it like it is 0% of the way there. We filled in the 90%->100% gaps for LLM finetuning and inference with a team of a few good hackers.
    18K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Jan 8, 2024
    New LLMs for new years! On my vacation I streamed a 5 hour walkthrough of building an optimized medical LLM from scratch using an AMD supercomputer from a beach in Hawaii. All of the code is open source! github.com/lamini-ai/lami… youtu.be/Xkzd_YNbWmc?fe… via @YouTube
    github.com
    GitHub - lamini-ai/lamini-examples
    Contribute to lamini-ai/lamini-examples development by creating an account on GitHub.
    21K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Dec 14, 2021
    I'm so excited to release two giant speech datasets today: mlcommons.org/en/peoples-spe… mlcommons.org/en/multilingua… with clean CC licenses with academic and commercial use! Special shout out to Daniel Galvez, Mark Mazumder, and the whole team who put in a huge effort to create these.
    mlcommons.org
    People's Speech Dataset | MLCommons Datasets
    The People’s Speech Dataset contains 30,000 hours of conversational English speech recognition licensed for academic and commercial machine learning usage.
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Mar 16, 2024
    Here's a new earnings call Q&A dataset I made with Lamini. It has millions of questions and answers generating using a Lamini LLM reading earnings call transcripts. huggingface.co/datasets/lamin… Still uploading - 10k finished so far - should have about 1M by tonight
    Image
    lamini/earnings-calls-qa · Datasets at Hugging Face
    From huggingface.co
    28K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Aug 16, 2023
    Excited to release 1.109 billion times faster LLM  switching using a PEFT cache in HBM.  See the blog for details of how it works. Finding an opportunity for a 1 billion times speedup suggests that we are just scratching the surface of fine tuning custom LLMs.
    user avatar
    Lamini
    @LaminiAI
    Aug 16, 2023
    Training multiple LLMs taking forever? 😤 Costing you a fortune?💸 Enter PEFT! Get ready to multiply!! 🚀 1000 models, just 1 machine! 🤖 3 months of training -> 3 milliseconds ⚡️ Just one API call, load and train with Lamini! 👉lamini.ai/blog/one-billi… 👀youtube.com/shorts/7X8fSSe…
    50K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Mar 30, 2024
    Hiring a world class ML Engineer at Lamini. Drop me a note if you want to push model accuracy to the limit on unlimited AMD compute.
    64K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    May 4, 2024
    This is the paper that convinced me - proceedings.mlr.press/v28/coates13.p… Showing that a Frankenstein CUDA cluster could beat a 10,000 cpu map reduce cluster
    user avatar
    Andrej Karpathy
    @karpathy
    May 3, 2024
    # CUDA/C++ origins of Deep Learning Fun fact many people might have heard about the ImageNet / AlexNet moment of 2012, and the deep learning revolution it started. en.wikipedia.org/wiki/AlexNet What's maybe a bit less known is that the code backing this winning submission to the
    Image
    20K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Nov 5, 2023
    Nice work replicating this. The recipe is out. AMD GPUs work. lamini.ai/blog/lamini-ll…
    user avatar
    Eric Hartford
    @QuixiAI
    Nov 5, 2023
    I've been working on my AI servers in my garage. (8x AMD Instinct mi100) Yesterday I got the first server running and inferencing with oobabooga. Now duplicating the os drive to get the 2nd server running. Then once infiniband is setup I will start working on getting
    Image
    Image
    Image
    Image
    23K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Mar 2, 2024
    We are hiring an HPC (MPI / OpenAI Triton) Engineer at Lamini. Apply here: jobs.lever.co/laminiai/af688… We are inventing and building the largest AMD LLM training system in the world. Join us in strongly scaling to 1000s of GPUs and beyond.
    15K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Oct 2, 2023
    Hit me up if you need GPUs for finetuning LLMs. We are bringing online more capacity together with AMD. Start training on AMD in 3 lines of code. pip install lamini from lamini import LlamaV2Runner model = LlamaV2Runner() model.load_data(path=...) model.train(args=...)
    user avatar
    Lisa Su
    AMD
    @LisaSu
    Sep 26, 2023
    Love working with @LaminiAI and @realSharonZhou making LLMs easy and accessible for all on @AMD @AMDInstinct GPUs! So cool what can be done with @LaminiAI LLM Superstations!!
    41K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Dec 21, 2023
    I remember the same time, it wasn’t luck. My first exposure to the CUDA vision was from John Nickolls . He very clearly saw it as becoming the dominant form of computing.
    user avatar
    Bryan Catanzaro
    @ctnzr
    Dec 20, 2023
    I worked at Intel on Larrabee applications in 2007. Then I went to NVIDIA to work on ML in 2008. So I was there at both places at that time and I can say: NVIDIA's dominance didn't come from luck. It came from vision and execution. Which Intel lacked.
    14K
  • user avatar
    Greg Diamos
    @GregoryDiamos
    Dec 7, 2017
    We used a supercomputer to perform the largest study to date of how deep learning scales up with more data and faster computers. It turns out to be simple and predictable across diverse applications. research.baidu.com/deep-learning-…

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement