Log inSign up
Danqi Chen
451 posts
Image
user avatar
Danqi Chen
@danqi_chen
Associate professor @princetonPLI @PrincetonCS MoTS @thinkymachines Previously: @facebookai, @stanfordnlp, @Tsinghua_Uni
Princeton, NJ
cs.princeton.edu/~danqic/
Joined December 2009
831
Following
18.4K
Followers
  • user avatar
    Danqi Chen
    @danqi_chen
    Oct 8, 2025
    I am going to present two papers at #COLM2025 tomorrow from 4:30-6:30pm, as none of our leading authors can attend due to visa issues. Haven't done poster presentations for years 🤣🤣 .... so I will do my best! #76: LongProc #80: Goedel-Prover v1
    Image
    Image
    Image
    user avatar
    Chi Jin
    @chijinML
    Oct 8, 2025
    Our Goedel-Prover V1 will be presented at COLM 2025 in Montreal this Wednesday afternoon! I won’t be there in person, but my amazing and renowned colleague @danqi_chen will be around to help with the poster — feel free to stop by!
    49K
  • user avatar
    Danqi Chen
    @danqi_chen
    Nov 12, 2019
    An article written about me :)
    user avatar
    Princeton Computer Science
    @PrincetonCS
    Nov 12, 2019
    Danqi Chen works in natural language processing or #NLP, a fast-moving field that uses #AI to create machines that not only read documents written by humans but also assimilate and manipulate the knowledge that the documents contain. Read more here: t.ly/Vwg0V
    Image
  • user avatar
    Danqi Chen
    @danqi_chen
    Apr 15, 2023
    New center at Princeton on large language models research. Come join us! 😍😍😍
    user avatar
    Sanjeev Arora
    @prfsanjeevarora
    Apr 14, 2023
    Princeton has a new Center for Language and Intelligence, researching LLMs + large AI models, as well as their interdisciplinary applications. Looking for postdocs/research scientists/engineers; attractive conditions. nlp.cs.princeton.edu/center-languag…
    52K
  • user avatar
    Danqi Chen
    @danqi_chen
    May 25, 2022
    I am super excited about this paper. A new training approach for LMs with memory augmentation! * A simple and (maybe) better training objective for LMs? * With clever memory construction and data batching, better than kNN-LM, Transformer-XL etc.
    user avatar
    Zexuan Zhong
    @ZexuanZhong
    May 25, 2022
    Very excited to share a preprint “Training Language Models with Memory Augmentation”! t.ly/6n2l We propose a new training objective TRIME for language modeling—inspired by contrastive learning—which aligns with both token embeddings and *in-batch memories*. 1/n
    Image
    Image
  • user avatar
    Danqi Chen
    @danqi_chen
    Jul 10, 2022
    #NAACL2022 I am already in Seattle. This is my first conference since I became a faculty😂..... Let's catch up of course :) Oh, and all my students are here!
  • user avatar
    Danqi Chen
    @danqi_chen
    Apr 18, 2021
    Very surprised and excited by this result. Contrastive learning can go a loooong way in NLP!
    user avatar
    Tianyu Gao
    @gaotianyu1350
    Apr 18, 2021
    💥 to share “SimCSE: Simple Contrastive Learning of Sentence Embeddings”. We show that a contrastive objective can be VERY effective with right *augmentation* or *datasets*. Large gains on STS tasks and unsup. SimCSE matches previous supervised results! bit.ly/3gqgh0d
    Image
  • user avatar
    Danqi Chen
    @danqi_chen
    Oct 4, 2024
    Today, we released - ProLong: A set of long-context models (512K context ). Only trained on 5% of Llama-3.1 budget but strong results. - Helmet: A comprehensive eval for LCLMs. Important to get the evaluation right first! Kudos to team @gaotianyu1350 @_awettig @HowardYen1
    user avatar
    Tianyu Gao
    @gaotianyu1350
    Oct 4, 2024
    Very proud to introduce two of our recent long-context works: HELMET (best long-context benchmark imo): shorturl.at/JnBHD ProLong (a cont’d training & SFT recipe + a SoTA 512K 8B model): shorturl.at/XQV7a Here is a story of how we arrived there
    Image
    28K
  • user avatar
    Danqi Chen
    @danqi_chen
    Dec 13, 2024
    I’ve just arrived in Vancouver and am excited to join the final stretch of #NeurIPS2024! This morning, we are presenting 3 papers 11am-2pm: - Edge pruning for finding Transformer circuits (#3111, spotlight) @AdithyaNLP - SimPO (#3410) @yumeng0818 @xiamengzhou - CharXiv (#5303)
    Image
    Image
    Image
    15K
  • user avatar
    Danqi Chen
    @danqi_chen
    May 24, 2024
    Glad this SimPO paper is finally out. I am intrigued by its simplicity and effectiveness. The team has done a very impressive job in various experimental settings (and careful hyper-parameter tuning!) and in-depth analysis. Kudos to @yumeng0818 @xiamengzhou
    user avatar
    Yu Meng
    @yumeng0818
    May 24, 2024
    Introducing SimPO: Simpler & more effective Preference Optimization!🎉 Significantly outperforms DPO w/o a reference model!📈 Llama-3-8B-SimPO ranked among top on leaderboards!💪 ✅44.7% LC win rate on AlpacaEval 2 ✅33.8% win rate on Arena-Hard arxiv.org/abs/2405.14734 🧵[1/n]
    Image
    99K
  • user avatar
    Danqi Chen
    @danqi_chen
    Nov 14, 2018
    We are planning the 2nd workshop on Machine Reading for Question Answering (MRQA): mrqa.github.io. This year we are adding a new shared task focusing on generalization of MRQA systems. Also features awesome speakers. Check it out and vote for us!
  • user avatar
    Danqi Chen
    @danqi_chen
    Feb 19, 2025
    V. happy with this work! We’ve explored domain mixtures and quality filtering (including Alex’s previous work!), but what is even a “domain” in Common Crawl? Can we use these domains to better understand quality filters, and combine them for data curation? Cool visuals too!
    user avatar
    Alex Wettig
    @_awettig
    Feb 18, 2025
    🤔 Ever wondered how prevalent some type of web content is during LM pre-training? In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐 Key takeaway: domains help us curate better pre-training data! 🧵/N
    Image
    15K
  • user avatar
    Danqi Chen
    @danqi_chen
    Dec 4, 2024
    Mengzhou is on the job market this year, and she is awesome :-)
    user avatar
    Sanjeev Arora
    @prfsanjeevarora
    Dec 3, 2024
    SimPO is a new method from @PrincetonPLI for improving AI models using preference data. It is simpler than last year's DPO and often outperforms it. Within a few months it has been widely adopted in models that have hit the top of the chatbot arena leaderboard in their
    25K
  • user avatar
    Danqi Chen
    @danqi_chen
    Jun 23, 2020
    Check it out at
    user avatar
    Google AI
    @GoogleAI
    Jun 23, 2020
    Announcing the EfficientQA competition and #NeurIPS2020 workshop, a collaborative effort with @Princeton and @UW that challenges developers to create end-to-end open-domain question answering systems that are small, yet robust. Learn all about it ↓ goo.gle/2AVm3Vg
    efficientqa.github.io
    Efficient Open-Domain Question Answering
    The official website for the open domain question answering challenge at NeurIPS 2020.
  • user avatar
    Danqi Chen
    @danqi_chen
    Dec 13, 2023
    I am at #NeurIPS2023 today! Students are presenting two oral papers: - @danfriedman0 Transformer Programs (Oral 3B / poster 3 #1509) - @SadhikaMalladi @gaotianyu1350 Memory-efficient zerothorder optimizer MeZO (Oral 4A / poster 4 #514) Come find us! More from Princeton 👇
    user avatar
    Princeton PLI
    @PrincetonPLI
    Dec 12, 2023
    Look at the breadth of Princeton research being presented at @NeurIPSConf (happening now) - not just in computer science, but also from a range of other departments. PLI blog post for details: bit.ly/3RFioiX
    Logo for Neural Information Processing Systems (NeurIPS) conference.
    25K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
Advertisement
Advertisement