Log inSign up
Jessy Lin
376 posts
Image
user avatar
Jessy Lin
@realJessyLin
ai x humans | prev PhD @Berkeley_AI
jessylin.com
Joined March 2013
1,022
Following
5,281
Followers
  • Pinned
    user avatar
    Jessy Lin
    @realJessyLin
    Oct 21, 2025
    As part of our recent work on memory layer architectures, I wrote up some of my thoughts on the continual learning problem broadly: Blog post: jessylin.com/2025/10/20/con… Some of the exposition goes beyond mem layers, so I thought it'd be useful to highlight separately:
    Image
    204K
  • user avatar
    Jessy Lin
    @realJessyLin
    Oct 21, 2025
    🧠 How can we equip LLMs with memory that allows them to continually learn new things? In our new paper with @AIatMeta, we show how sparsely finetuning memory layers enables targeted updates for continual learning, w/ minimal interference with existing knowledge. While full
    Image
    318K
  • user avatar
    Jessy Lin
    @realJessyLin
    Aug 27, 2025
    🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge? In new work with @AIatMeta, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results: * 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia
    Image
    132K
  • user avatar
    Jessy Lin
    @realJessyLin
    Aug 4, 2023
    How can agents understand the world from diverse language? 🌎 Excited to introduce Dynalang, an agent that learns to understand language by 𝙢𝙖𝙠𝙞𝙣𝙜 𝙥𝙧𝙚𝙙𝙞𝙘𝙩𝙞𝙤𝙣𝙨 𝙖𝙗𝙤𝙪𝙩 𝙩𝙝𝙚 𝙛𝙪𝙩𝙪𝙧𝙚 with a multimodal world model!
    Image
    00:00
    108K
  • user avatar
    Jessy Lin
    @realJessyLin
    Jun 1, 2023
    How can agents like LLMs become decision-making partners for humans? 💬 Excited to share a new paper + suite of envs for 𝘥𝘦𝘤𝘪𝘴𝘪𝘰𝘯-𝘰𝘳𝘪𝘦𝘯𝘵𝘦𝘥 𝘥𝘪𝘢𝘭𝘰𝘨𝘶𝘦𝘴, where agents + humans collab to solve hard everyday problems. [1/n] Site: collaborative-dialogue.github.io
    Image
    00:00
    87K
  • user avatar
    Jessy Lin
    @realJessyLin
    Jul 10, 2025
    User simulators bridge RL with real-world interaction // jessylin.com/2025/07/10/use… How do we get the RL paradigm to work on tasks beyond math & code? Instead of designing datasets, RL requires designing environments. Given that most non-trivial real-world tasks involve
    Image
    37K
  • user avatar
    Jessy Lin
    @realJessyLin
    Apr 18, 2022
    How can agents infer what people want from what they say? In our new paper at #acl2022nlp w/ @dan_fried, Dan Klein, and @ancadianadragan, we learn preferences from language by reasoning about how people communicate in context. Paper: arxiv.org/abs/2204.02515 [1/n]
    Image
  • user avatar
    Jessy Lin
    @realJessyLin
    Jul 7, 2022
    I’m really honored that our paper, "Automatic Correction of Human Translations," won multiple awards at #NAACL2022! 😊 This paper came out of working w/ @LiltHQ to ask: how can we use NLP to assist real humans (professional translators)? arxiv.org/abs/2206.08593 More below: [1/]
    Image
    user avatar
    NAACL HLT 2027
    @naaclmeeting
    Jun 30, 2022
    #NAACL2022 is announcing best paper awards and outstanding papers: 2022.naacl.org/blog/best-pape… Please check them out!
  • user avatar
    Jessy Lin
    @realJessyLin
    Apr 21, 2025
    I’ll be at #ICLR2025 this week! ✈️ A couple of things I’m excited about lately: 1) Real-time multimodal models: how do we post-train assistants for real-time (and real world) tasks beyond the chat box? 2) Continual learning and memory: to have models / agents that learn from
    16K
  • user avatar
    Jessy Lin
    @realJessyLin
    Oct 21, 2025
    Replying to @realJessyLin and @AIatMeta
    I'm really grateful for my great collaborators!♥️ @LukeZettlemoyer @gargighosh @scottyih @aramHmarkosyan @vinceberges @barlas_berkeley 📄Paper: arxiv.org/abs/2510.15103 📒 Blog post with some broader thoughts on the continual learning problem: jessylin.com/2025/10/20/con… [n/n]
    arXiv logo
    arxiv.org
    Continual Learning via Sparse Memory Finetuning
    Modern language models are powerful, but typically static after deployment. A major obstacle to building models that continually learn over time is catastrophic forgetting, where updating on new...
    7.6K
  • user avatar
    Jessy Lin
    @realJessyLin
    Aug 27, 2025
    Replying to @realJessyLin and @AIatMeta
    🚀 We're excited to release the 1T Active Reading-augmented Wikipedia dataset and open-source the WikiExpert model for the community: Paper: arxiv.org/abs/2508.09494 Dataset: huggingface.co/datasets/faceb… Model: huggingface.co/facebook/meta-… Thanks to my great collaborators – @vinceberges,
    arXiv logo
    arxiv.org
    Learning Facts at Scale with Active Reading
    LLMs are known to store vast amounts of knowledge in their parametric memory. However, learning and recalling facts from this memory is known to be unreliable, depending largely on the prevalence...
    10K
  • user avatar
    Jessy Lin
    @realJessyLin
    Oct 21, 2025
    Replying to @realJessyLin
    There's a huge spectrum of approaches to memory/continual learning - ranging from RAG to dreams of "infinite context" generalization to baking in new knowledge w/ gradient updates. I'm personally bullish on parametric updates that allow the model itself to get smarter over time
    Image
    14K
  • user avatar
    Jessy Lin
    @realJessyLin
    Oct 9, 2024
    Really cool of ICLR to experiment with making AI part of the reviewing process. Instead of rejecting AI assistance and pretending that people aren't already using LMs to read/write/understand things, we can learn a lot from trying to make it part of our process (even if
    Image
    7.8K
  • user avatar
    Jessy Lin
    @realJessyLin
    Oct 21, 2025
    Replying to @realJessyLin and @AIatMeta
    To learn something new, we shouldn’t need to finetune all the parameters of a large model. This motivates parameter-efficient methods for continual learning/memory, like LoRA and Cartridges, which add a small set of params to the model. However, LoRA is inherently low-capacity
    12K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement