Image

Image Duolingo Research

Science powers our mission to make language education free and accessible to everyone.

About Us

Publications

Data & Tools

  • 2020 Notification Bandit Data

    Replication data for our KDD 2020 paper, "A Sleeping, Recovering Bandit Algorithm for Optimizing Recurring Notifications." Includes 200 million examples of Duolingo practice reminder push notifications sent to Duolingo users over a 35 day period, including which template was used, whether the user converted within 2 hours, and other metadata.

  • 2020 STAPLE Shared Task Data

    Data for the 2020 Shared Task on Simultaneous Translation And Paraphrase for Language Education (STAPLE). This corpus contains more than 3 million pairs of English sentences with multiple possible translations into Portuguese, Hungarian, Japanese, Korean, and Vietnamese.

  • 2018 SLAM Shared Task Data

    Data for the 2018 Shared Task on Second Language Acquisition Modeling (SLAM). This corpus contains 7 million words produced by learners of English, Spanish, and French. It includes user demographics, morph-syntactic metadata, response times, and longitudinal errors for 6k+ users over 30 days.

  • Spaced Repetition Data

    Data used to develop our half-life regression (HLR) spaced repetition algorithm. This is a collection of 13 million user-word pairs for learners of several languages with a variety of language backgrounds. It includes practice recall rates, lag times between practices, and other morpho-lexical metadata.

Our Team

  • Image
    André Horie AI + Machine Learning
  • Image
    Bożena Pająk Learning + Curriculum
  • Image
    Erin Gustafson Data Science + Analytics
  • Image
    Cindy Berger Learning + Curriculum
  • Image
    Angela DiCostanzo Learning + Curriculum
  • Image
    Cindy Blanco Learning + Curriculum
  • Image
    Lisa Bromberg Learning + Curriculum
  • Image
    Klinton Bicknell AI + Machine Learning
  • Image
    Will Monroe AI + Machine Learning
  • Image
    Geoff LaFlair Assessment + Psychometrics
  • Image
    Hope Wilson Learning + Curriculum
  • Image
    Kevin Yancey AI + Machine Learning
  • Image
    Xiangying Jiang Learning + Curriculum
  • Image
    Jessica Becker Learning + Curriculum
  • Image
    Stephen Mayhew AI + Machine Learning
  • Image
    Meredith McDermott UX Research
  • Image
    Andrew Runge AI + Machine Learning
  • Image
    Connor Brem AI + Machine Learning
  • Image
    Emily Moline Learning + Curriculum
  • Image
    Elizabeth Strong Learning + Curriculum
  • Image
    Cory Wheeler Learning + Curriculum
  • Image
    Lauren Bilsky AI + Machine Learning
  • Image
    Emma Gibson Learning + Curriculum
  • Image
    James Leow Learning + Curriculum
  • Image
    Danchen Yang Learning + Curriculum
  • Image
    Isabel Deibel Learning + Curriculum
  • Image
    Elizabeth Onstwedder Learning + Curriculum
  • Image
    Kevin Lenzo AI + Machine Learning
  • Image
    Mancy Liao Assessment + Psychometrics
  • Image
    Nora Gordon Learning + Curriculum
  • Image
    Sharon Wilkinson Learning + Curriculum
  • Image
    Naveen Shankar Data Science + Analytics
  • Image
    Antony Kunnan Assessment + Psychometrics
  • Image
    Jackie Bialostozky Learning + Curriculum
  • Image
    Lucy Portnoff Data Science + Analytics
  • Image
    Ramsey Cardwell Assessment + Psychometrics
  • Image
    Alina von Davier Assessment + Psychometrics
  • Image
    Yigal Attali Assessment + Psychometrics
  • Image
    Audrey Kittredge Learning + Curriculum
  • Image
    Ben Reuveni Learning + Curriculum
  • Image
    J.R. Lockwood Assessment + Psychometrics
  • Image
    Rich Forest Learning + Curriculum
  • Image
    Mark Lock Data Science + Analytics
  • Image
    Will Belzak Assessment + Psychometrics
Image
Image