Log inSign up
Fred Sala
487 posts
Image
user avatar
Fred Sala
@fredsala
Assistant Professor @WisconsinCS. Chief scientist @SnorkelAI. Working on machine learning & information theory.
Madison, WI
pages.cs.wisc.edu/~fredsala/
Joined July 2010
842
Following
1,551
Followers
  • user avatar
    Fred Sala
    @fredsala
    Apr 20, 2020
    Thrilled to share that I will be joining the University of Wisconsin @WisconsinCS as an assistant professor in January 2021. Incredibly grateful to all of the wonderful people who have supported me on this journey!
  • user avatar
    Fred Sala
    @fredsala
    Feb 26, 2025
    Tired of evaluating frontier models on contrived math olympiad problems? We have a cure! Try your models on our new benchmark for theoretical physics TPBench.
    Image
    22K
  • user avatar
    Fred Sala
    @fredsala
    Oct 7, 2024
    Distillation is an important process, but why limit ourselves to distilling models into models, instead of into other objects? In new work from my group, we distill model capabilities into programs—a spotlight at #neurips2024.
    user avatar
    Tzu-Heng (Brian) Huang
    @zihengh1
    Oct 7, 2024
    Annotating your data with state-of-the-art large language models can be costly and opaque. What can we do about this? Simple idea: instead of prompting LLMs for labels, we distill them into programs you can run locally for free. Introducing Alchemist, a Spotlight at #NeurIPS2024!
    19K
  • user avatar
    Fred Sala
    @fredsala
    Sep 25, 2023
    Excited to share six @NeurIPS 2023 papers with two spotlights from my group and with our wonderful collaborators! See you in New Orleans!
    19K
  • user avatar
    Fred Sala
    @fredsala
    May 3, 2023
    Generative models are awesome at producing data, and weak supervision is great at efficient labeling. Can we combine them to get cheap datasets for training or fine-tuning? Excited to present our #ICLR2023 paper "Generative Modeling Helps Weak Supervision (and Vice Versa)"
    Image
    8.6K
  • user avatar
    Fred Sala
    @fredsala
    Jan 19, 2023
    Very excited to share that our NeurIPS tutorial with Ramya Vinayak on efficient dataset construction is now freely available: nips.cc/virtual/2022/t… Bonus: amazing panel with @MayeeChen, @rdnowak, @codyaustun, and @SnorkelAI's @ajratner!
    9.3K
  • user avatar
    Fred Sala
    @fredsala
    Dec 11, 2024
    First up at #NeurIPS2024 from our group, our work on labeling via programmatic distillation (a spotlight!). Label your data orders of magnitude faster and cheaper — come join us today at Poster Session 2 East for a demo!
    Image
    3.6K
  • user avatar
    Fred Sala
    @fredsala
    Dec 11, 2023
    Can't wait to see everyone at #NeurIPS2023! Excited to present lots of fun work from my group and with our awesome collaborators, including
    Image
    17K
  • user avatar
    Fred Sala
    @fredsala
    May 6, 2024
    Come by #ICLR2024 Session 2 on Tuesday to see our work using representation editing to make foundation models robust! No fine-tuning, no additional data, no problem. arxiv.org/pdf/2309.04344
    Image
    7.6K
  • user avatar
    Fred Sala
    @fredsala
    Apr 27, 2022
    Join us tomorrow at @iclr_conf for our work on automating dataset construction for diverse data types arxiv.org/pdf/2112.03865… Poster Session 12, Thursday evening
  • user avatar
    Fred Sala
    @fredsala
    Jan 16, 2024
    Excited to share that our work on improving the robustness of foundation models, without training or data, will be at @iclr_conf ! Longer version of our paper that won best paper honorable mention at the NeurIPS R0-FoMo workshop last month.
    This Post is from an account that no longer exists. Learn more
    8.7K
  • user avatar
    Fred Sala
    @fredsala
    Jun 5, 2024
    Fun new work from our group spearheaded by @nick11roberts: we build new hybrid mixed-architecture models from pretrained model building blocks! arxiv.org/pdf/2406.00894 Feedback and comments appreciated!
    user avatar
    Nicholas Roberts
    @nick11roberts
    Jun 5, 2024
    So many new LLM architectures (Mambas🐍, Transformers🤖,🦙,🦔, Hyenas🐺,🦓…), so little GPU time to combine them into hybrid LLMs… Good news! Today we release Manticore, a system for creating **pretrained hybrids** from pretrained models! 👨‍🌾🦁🦂 arxiv.org/pdf/2406.00894 1/n
    Image
    5K
  • user avatar
    Fred Sala
    @fredsala
    Feb 5, 2025
    Some new work from our group that I'm very excited about! What makes weak-to-strong generalization possible? We think it's all about data!
    user avatar
    Changho Shin
    @Changho_Shin_
    Feb 5, 2025
    What enables a strong model to surpass its weaker teacher? 🚀 Excited to share our ICLR 2025 paper: "Weak-to-Strong Generalization Through the Data-Centric Lens"! 🧵
    Image
    3.1K
  • user avatar
    Fred Sala
    @fredsala
    Jun 19, 2023
    Excited to receive an American Family Funding Initiative Award for my group's work on data-efficient customization for large pretrained models! Thanks to the Data Science Institute (DSI) and @amfam!
    Image
    5.6K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement