Jessy Lin (@realJessyLin) / X

Jessy Lin

376 posts

Jessy Lin

@realJessyLin

ai x humans | prev PhD @Berkeley_AI

Joined March 2013

Pinned
Jessy Lin
@realJessyLin
Oct 21, 2025
As part of our recent work on memory layer architectures, I wrote up some of my thoughts on the continual learning problem broadly: Blog post: jessylin.com/2025/10/20/con… Some of the exposition goes beyond mem layers, so I thought it'd be useful to highlight separately:
204K
Jessy Lin
@realJessyLin
Oct 21, 2025
🧠 How can we equip LLMs with memory that allows them to continually learn new things? In our new paper with @AIatMeta, we show how sparsely finetuning memory layers enables targeted updates for continual learning, w/ minimal interference with existing knowledge. While full
318K
Jessy Lin
@realJessyLin
Aug 27, 2025
🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge? In new work with @AIatMeta, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results: * 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia
132K
Jessy Lin
@realJessyLin
Aug 4, 2023
How can agents understand the world from diverse language? 🌎 Excited to introduce Dynalang, an agent that learns to understand language by 𝙢𝙖𝙠𝙞𝙣𝙜 𝙥𝙧𝙚𝙙𝙞𝙘𝙩𝙞𝙤𝙣𝙨 𝙖𝙗𝙤𝙪𝙩 𝙩𝙝𝙚 𝙛𝙪𝙩𝙪𝙧𝙚 with a multimodal world model!
00:00
108K
Jessy Lin
@realJessyLin
Jun 1, 2023
How can agents like LLMs become decision-making partners for humans? 💬 Excited to share a new paper + suite of envs for 𝘥𝘦𝘤𝘪𝘴𝘪𝘰𝘯-𝘰𝘳𝘪𝘦𝘯𝘵𝘦𝘥 𝘥𝘪𝘢𝘭𝘰𝘨𝘶𝘦𝘴, where agents + humans collab to solve hard everyday problems. [1/n] Site: collaborative-dialogue.github.io
00:00
87K
Jessy Lin
@realJessyLin
Jul 10, 2025
User simulators bridge RL with real-world interaction // jessylin.com/2025/07/10/use… How do we get the RL paradigm to work on tasks beyond math & code? Instead of designing datasets, RL requires designing environments. Given that most non-trivial real-world tasks involve
37K
Jessy Lin
@realJessyLin
Apr 18, 2022
How can agents infer what people want from what they say? In our new paper at #acl2022nlp w/ @dan_fried, Dan Klein, and @ancadianadragan, we learn preferences from language by reasoning about how people communicate in context. Paper: arxiv.org/abs/2204.02515 [1/n]
Jessy Lin
@realJessyLin
Jul 7, 2022
I’m really honored that our paper, "Automatic Correction of Human Translations," won multiple awards at #NAACL2022! 😊 This paper came out of working w/ @LiltHQ to ask: how can we use NLP to assist real humans (professional translators)? arxiv.org/abs/2206.08593 More below: [1/]
NAACL HLT 2027
@naaclmeeting
Jun 30, 2022
#NAACL2022 is announcing best paper awards and outstanding papers: 2022.naacl.org/blog/best-pape… Please check them out!
Jessy Lin
@realJessyLin
Apr 21, 2025
I’ll be at #ICLR2025 this week! ✈️ A couple of things I’m excited about lately: 1) Real-time multimodal models: how do we post-train assistants for real-time (and real world) tasks beyond the chat box? 2) Continual learning and memory: to have models / agents that learn from
16K
Jessy Lin
@realJessyLin
Oct 21, 2025
Replying to @realJessyLin and @AIatMeta
I'm really grateful for my great collaborators!♥️ @LukeZettlemoyer @gargighosh @scottyih @aramHmarkosyan @vinceberges @barlas_berkeley 📄Paper: arxiv.org/abs/2510.15103 📒 Blog post with some broader thoughts on the continual learning problem: jessylin.com/2025/10/20/con… [n/n]
arxiv.org
Continual Learning via Sparse Memory Finetuning
Modern language models are powerful, but typically static after deployment. A major obstacle to building models that continually learn over time is catastrophic forgetting, where updating on new...
7.6K
Jessy Lin
@realJessyLin
Aug 27, 2025
Replying to @realJessyLin and @AIatMeta
🚀 We're excited to release the 1T Active Reading-augmented Wikipedia dataset and open-source the WikiExpert model for the community: Paper: arxiv.org/abs/2508.09494 Dataset: huggingface.co/datasets/faceb… Model: huggingface.co/facebook/meta-… Thanks to my great collaborators – @vinceberges,
arxiv.org
Learning Facts at Scale with Active Reading
LLMs are known to store vast amounts of knowledge in their parametric memory. However, learning and recalling facts from this memory is known to be unreliable, depending largely on the prevalence...
10K
Jessy Lin
@realJessyLin
Oct 21, 2025
Replying to @realJessyLin
There's a huge spectrum of approaches to memory/continual learning - ranging from RAG to dreams of "infinite context" generalization to baking in new knowledge w/ gradient updates. I'm personally bullish on parametric updates that allow the model itself to get smarter over time
14K
Jessy Lin
@realJessyLin
Oct 9, 2024
Really cool of ICLR to experiment with making AI part of the reviewing process. Instead of rejecting AI assistance and pretending that people aren't already using LMs to read/write/understand things, we can learn a lot from trying to make it part of our process (even if
7.8K
Jessy Lin
@realJessyLin
Oct 21, 2025
Replying to @realJessyLin and @AIatMeta
To learn something new, we shouldn’t need to finetune all the parameters of a large model. This motivates parameter-efficient methods for continual learning/memory, like LoRA and Cartridges, which add a small set of params to the model. However, LoRA is inherently low-capacity
12K