Fred Sala (@fredsala) / X

Fred Sala

487 posts

Fred Sala

@fredsala

Assistant Professor @WisconsinCS. Chief scientist @SnorkelAI. Working on machine learning & information theory.

Madison, WI

pages.cs.wisc.edu/~fredsala/

Joined July 2010

Fred Sala
@fredsala
Apr 20, 2020
Thrilled to share that I will be joining the University of Wisconsin @WisconsinCS as an assistant professor in January 2021. Incredibly grateful to all of the wonderful people who have supported me on this journey!
Fred Sala
@fredsala
Feb 26, 2025
Tired of evaluating frontier models on contrived math olympiad problems? We have a cure! Try your models on our new benchmark for theoretical physics TPBench.
22K
Fred Sala
@fredsala
Oct 7, 2024
Distillation is an important process, but why limit ourselves to distilling models into models, instead of into other objects? In new work from my group, we distill model capabilities into programs—a spotlight at #neurips2024.
Tzu-Heng (Brian) Huang
@zihengh1
Oct 7, 2024
Annotating your data with state-of-the-art large language models can be costly and opaque. What can we do about this? Simple idea: instead of prompting LLMs for labels, we distill them into programs you can run locally for free. Introducing Alchemist, a Spotlight at #NeurIPS2024!
19K
Fred Sala
@fredsala
Sep 25, 2023
Excited to share six @NeurIPS 2023 papers with two spotlights from my group and with our wonderful collaborators! See you in New Orleans!
19K
Fred Sala
@fredsala
May 3, 2023
Generative models are awesome at producing data, and weak supervision is great at efficient labeling. Can we combine them to get cheap datasets for training or fine-tuning? Excited to present our #ICLR2023 paper "Generative Modeling Helps Weak Supervision (and Vice Versa)"
8.6K
Fred Sala
@fredsala
Jan 19, 2023
Very excited to share that our NeurIPS tutorial with Ramya Vinayak on efficient dataset construction is now freely available: nips.cc/virtual/2022/t… Bonus: amazing panel with @MayeeChen, @rdnowak, @codyaustun, and @SnorkelAI's @ajratner!
9.3K
Fred Sala
@fredsala
Dec 11, 2024
First up at #NeurIPS2024 from our group, our work on labeling via programmatic distillation (a spotlight!). Label your data orders of magnitude faster and cheaper — come join us today at Poster Session 2 East for a demo!
3.6K
Fred Sala
@fredsala
Dec 11, 2023
Can't wait to see everyone at #NeurIPS2023! Excited to present lots of fun work from my group and with our awesome collaborators, including
17K
Fred Sala
@fredsala
May 6, 2024
Come by #ICLR2024 Session 2 on Tuesday to see our work using representation editing to make foundation models robust! No fine-tuning, no additional data, no problem. arxiv.org/pdf/2309.04344
7.6K
Fred Sala
@fredsala
Apr 27, 2022
Join us tomorrow at @iclr_conf for our work on automating dataset construction for diverse data types arxiv.org/pdf/2112.03865… Poster Session 12, Thursday evening
Fred Sala
@fredsala
Jan 16, 2024
Excited to share that our work on improving the robustness of foundation models, without training or data, will be at @iclr_conf ! Longer version of our paper that won best paper honorable mention at the NeurIPS R0-FoMo workshop last month.
This Post is from an account that no longer exists. Learn more
8.7K
Fred Sala
@fredsala
Jun 5, 2024
Fun new work from our group spearheaded by @nick11roberts: we build new hybrid mixed-architecture models from pretrained model building blocks! arxiv.org/pdf/2406.00894 Feedback and comments appreciated!
Nicholas Roberts
@nick11roberts
Jun 5, 2024
So many new LLM architectures (Mambas🐍, Transformers🤖,🦙,🦔, Hyenas🐺,🦓…), so little GPU time to combine them into hybrid LLMs… Good news! Today we release Manticore, a system for creating **pretrained hybrids** from pretrained models! 👨‍🌾🦁🦂 arxiv.org/pdf/2406.00894 1/n
5K
Fred Sala
@fredsala
Feb 5, 2025
Some new work from our group that I'm very excited about! What makes weak-to-strong generalization possible? We think it's all about data!
Changho Shin
@Changho_Shin_
Feb 5, 2025
What enables a strong model to surpass its weaker teacher? 🚀 Excited to share our ICLR 2025 paper: "Weak-to-Strong Generalization Through the Data-Centric Lens"! 🧵
3.1K
Fred Sala
@fredsala
Jun 19, 2023
Excited to receive an American Family Funding Initiative Award for my group's work on data-efficient customization for large pretrained models! Thanks to the Data Science Institute (DSI) and @amfam!
5.6K