Welcome to the Advanced Analytics & AI for Communication Science Group!
Directed by Joshua Hartshorne at MGH IHP
About AI4CommSci
The rapid advances in artificial intelligence don’t just change how we interact with
our speakers, choose TV shows to watch, or search for information online. They also change
what kind of science is possible and expand what it knowable. Established in 2024,
our group’s vision is to push the envelope of what is possible, making breakthroughs
in theory and practice for the communication sciences. In addition to our own research,
we seek to enhance the productivity of the field as a whole, through training junior
scholars and by disseminating methods, algorithms, and technology. Below, we highlight
a few major threads in our research. Space considerations mean leaving out a lot of cool
stuff; a full list of publications can be found here. Another good source is the extensive
media coverage of our work, compiled here.
Critical Periods for Learning a Second Language
A plot of the percentage of a language’s grammar a monolingual someone can learn as a function
of age, assuming that they are speaking only that language. (For instance, a 1-year-old is estimated
to learn about 16% of the language’s grammar in a year.) Most people learning a second language still
speak their first language much of the time, so will learn more slowly. From Chen & Hartshorne (2021).
People who learned a second language in childhood are difficult to distinguish from native speakers,
whereas those who began in adulthood are often saddled with an accent and conspicuous
grammatical errors. Despite being one of the oldest findings in the communication sciences,
the reasons remain a mystery. For one thing, until very recently, it was unclear at exactly
what age language learning becomes more difficult. By combining a dataset of unprecedented
size (>600,000 people) and a novel analytic model, we showed that the rate of grammar learning
declines dramatically in late adolescence.
This is despite the fact that older learners have the advantage of having already learned a
first language. In a recent study, we showed that older children actually learn a new language
more rapidly than younger children — but in proportion to how similar their first language is
to the new language. (We do not yet know how this “linguistic transfer” affects the rate of
learning in adults.)
We are currently trying to piece together exactly what is going on through a combination of
neuroimaging studies, computational modeling, and new behavioral studies with different types
of learners (for example, refugees or learners in immersion schools).
In the neuroimaging work, we are taking advantage of a machine learning-based method we developed
with our colleague Stefano Anzellotti at Boston College called “synthetic twin analysis”.
This allows for quantifying variation in neuroanatomy with far more precision than was
previously possible. Initial findings suggest that the fundamental nature of the neural basis
of second language acquisition does not change until at least late adolescence – consistent
with our behavioral findings. We are working to expand this work to look at adults.
A panel from our 2022 paper in Science that introduced synthetic twin analysis. For details, see the original paper.
This work has been supported by a Simons Foundation SFARI and an NIH NRSA.
What Makes Humans Such Efficient Language Learners?
Humans are shockingly good at learning language. This has been clear to scientists for some
time, as one learning algorithm after another has been shown to be insufficient. Recent Large
Language Models only reinforce this point. We learn much faster than they do:
Llama 2 – an open-source competitor to ChatGPT – is trained on more language than a small city
of humans would encounter in a lifetime. In addition, modern Large Language Models require the
equivalent of vast amounts of drilling on language – something that humans do not need.
The question is why. Answering this question is key to helping individuals who struggle with language,
whether due to a learning disability or to injury or stroke.
Together with Jesse Snedeker at Harvard, we have proposed an account (Conceptual Nativism) that
treats language-learning as a kind of code-breaking. Unlike Llama or ChatGPT, learners have thoughts
and are trying to learn how to express them through language. This is obvious for any adult learner,
but work in developmental psychology shows that this is likely true for babies as well. We argue –
and are currently trying to show computationally – that this puts strong constraints on learning,
vastly increasing the efficiency at which one can learn. Much of our work to date has focused on
testing a central prediction of Conceptual Nativism, which is that even though grammar is often
thought to be a set of arbitrary and opaque rules, in fact the grammars of the world’s languages are
tightly constrained by meaning. (Grammar seems opaque because we only really notice the parts that
are arbitrary, though in fact they actually rare.) In one case study after another, we show that this
is in fact the case.
Some additional evidence comes from the observation that children acquiring two languages are even more
efficient than those acquiring only one. Naively, one might expect a bilingual-acquiring child to learn each
language at half the rate of their monolingual peers, but in fact on a per-language basis, they are much
faster. One possibility, which we are currently exploring, is that bilinguals are able to use what they
have learned about each language to learn the other.
By elementary school, learning curves for grammar (A) are nearly indistinguishable for simultaneous
bilinguals (N=30,397) and monolinguals (N=246,497; Hartshorne et al., 2018). By middle childhood,
learning curves for vocabulary (B) are entirely indistinguishable for simultaneous bilinguals (N=4,207)
and monolinguals (N=48,162) (Hua & Hartshorne, in prep). During initial learning, English vocabulary
learning by bilinguals (N=646) lags monolinguals (N=11,040) as a function of age (C), but exceeds it as
a function of input (D; Hua & Hartshorne, in prep).
In current and planned work, we are building out computational models of Conceptual Nativism. This is an ambitious
task, as modeling how people learn to talk about their thoughts about the world requires good models of both
thought and the world.
This work has been supported by NSF #2033938, #1606285, #1551834, an NSF CAREER grant (#2238912), and an NIH NRSA.
Scaling the Cognitive and Behavioral Sciences
Research in the communication and other cognitive/behavioral sciences is slow, because collecting data
is slow. While statisticians routinely report that robust & reliable results require collecting
orders of magnitude more data than is typical, researchers often struggle to collect even that much.
We were one of a handful of research groups that pioneered collecting vast datasets through online
games, quizzes, and citizen science. While many researchers want to do such studies, there is a
significant barrier to entry.
Left: We asked cognitive scientists how they would collect data if they needed one thousand, ten thousand,
or one hundred thousand subjects. Note that most of our studies involve at least ten thousand. Right: The vast
majority of these scientists reported they wanted to conduct massive online studies (>10,000 subjects), but
hardly any had. In contrast, while many had done smaller studies using Amazon Mechanical Turk or Prolific,
a smaller number actually wanted to. Data: 322 cognitive scientists surveyed in 2022.
In order to address that barrier, we have developed free-and-open-source software (Pushkin) for running
massive citizen science experiments in the cognitive and behavioral sciences. This work has been supported by NSF
#2229631, #2318474, #2029637, #1551834.
Latest News
Summer 2025
The end of a successful summer internship program!
4.22.25
Hunter was quoted in the Wall Street Journal! Read the article, "Are Your Data Private Anymore?" here.
3.21.25
AI4CommSci was awarded an NSF Research Experiences for Undergraduates supplement that will fund two undergraduate researchers this summer.
2.28.25
Josh and Meng publish in Nature Review Psychology. Read the article here.
2.18.25
PI Josh Hartshorne gave a talk for the Emergentism, Ecosystem, and Expertise series via Zoom on conceptual nativism. Watch the recording here: recording
1.6.25
Emily and Josh were in Taiwan to meet with our collaborator Li-May Sung of National Taiwan University.
Josh, Emily, Éric, and Omar had a poster accepted at the 9th International Conference on Language Documentation & Conservation on their work digitizing archival Formosan documents.
Joshua Hartshorne was awarded a new grant from the National Science Foundation, titled "An open-source ecosystem for massive online experiments and citizen science."
6.7.22
Congratulations to our postdoc Aidas who had a paper accepted in Science! Read about it here.
5.9.22
Congratulations to Ethan Amato for completing his honors thesis and being part of the graduating class of 2022!
10.8.21
Joshua Hartshorne has published a new paper in the Annual Review of Linguistics:
Joshua Hartshorne, along with former undergrad Lily Feinberg and former lab manager Lauren Skorb, has published two papers in Advances in Methods and Practices in Psychological Science.[1][2] He has also published a solo paper in Current Opinion in Behavioral Sciences.
10.30.20
Parker Robbins presented findings from the KidTalk project at the USC Center for Economic and Social Research's COVID-19
Work in Progress Conference.
L3 welcomes two new Northeastern Co-ops: Alex Ichimura and Megan Rest!
5.18.20
A special congratulations to Jaq Pyle, Tony Chen, and Wendy Uelk for being part of the graduating class of 2020!
4.22.20
L3 is now studying the effects of social distancing on language development! "Have COVID-19 affected young children's daily lives? Researchers at Boston College want to know! If you live with a child 1 to 9 years old, please fill out this 3-minute survey.
1.6.20
Anna Petti, a co-op research assistant from Northeastern University, joins L3!
11.21.19
Joshua Hartshorne and Mariela Jennings were co-authors in a new paper in Science on cultural universals in music.
9.18.19
The lab was awarded a new three-year grant from the National Science Foundation to study the role of unsupervised learning in early language acquisition.
9.17.19
Check out this interview, where Josh discusses language acquisition, immersion, and the critical period!
5.13.19
Read about our work on lifespan development in the Boston Globe!
5.06.19
Three of our lab members presented their work at PURC! Congrats Juliani Vidal, Lily Feinberg, & Tony Chen!
Josh and Mariela published a paper about software for massive online experients! Check it out here.
1.18.19
We have an opening for a post-doctoral researcher! Check out the details here.
1.18.19
Applications for our Summer 2019 research assistant internship program are now open! Learn more here!
1.17.19
Joshua Hartshorne's class on Language Acquisition published a paper replicating Saffran, Newport, and Aslin (1996)'s experiment on adult statistical word segmentation. Additional authors include L3's Juliani Vidal and Caitlin Garcia.
12.17.18
Joshua Hartshorne's work on critical periods was featured on NPR's Innovation Hub!
9.10.18
As the fall semester begins, we welcome many new research assistants to our team. View the people page here and see who we are!
9.8.18
Mariela presents her poster titled "How Much Does Verb Semantics Determine Verb Syntax?" at the AMLaP conference in Berlin, Germany. View the poster here!
9.8.18
Mariela presents her poster titled "Pushkin: An Open-source Engine for Social Science at Scale" at the AMLaP conference in Berlin, Germany. View the poster here!
9.10.18
Another great summer progam draws to a close, and we thank all of the research assistants who made the past few months fun a blast! The lab made progress in our online studies production, our field testing excursions, and web development efforts. RAs working on their personal and thesis projects made great use of their time. Go L3!
7.28.18
Tiwa presents his poster titled "Grammatical Accents: Using Machine Learning to Quantify Language Transfer" at the CogSci conference in Madison, Wisconsin. View the poster here!
6.18.18
The lab welcomes new RAs Rudmila, Taylor and Kate as they start their summer internship! They will be assisting with kid testing studies in both the Artisani and Boston Common parks along with the Acton Discovery Museum.
6.7.18
Joshua Hartshorne and Mariela Jennings are co-organizing a workshop on massive online experiments at this year's CogSci.
5.23.18
Read our newsletter to catch up on the lab's latest accomplishments during the Spring 2018 term!
5.4.2018
Media coverage surrounding Joshua Hartshorne's latest critical period study continues! Visit our Media Page to read articles from Scientific American, Newsweek, Time, and Daily Mail. Haven't read the paper? Find it here!
5.2.2018
Joshua K. Hartshorne, Joshua B. Tenenbaum, and Steven Pinker just had an article accepted in Cognition:
A critical period for second language acquisition: Evidence from 2/3 million English speakers.
L3 takes on candlepin bowling! It was right up our alley!
2.4.2018
Today is the Language Learning Lab's first time testing at the Boston Children's Museum! We will be there 1:30 to 4:30 and running studies for 3 to 6 year olds!
1.21.2018
The Language Learning Lab welcomes its newest addition! Dr. Hartshorne and his wife welcome their first child into the world!
1.16.2018
L3 welcomes the new undergraduate research assistants for the semester: Ning Duan, Rachel Duquette, Seung Kim, and David Kocen!
1.3.2018
Hayley Greenough, a co-op research assistant from Northeastern University, joins L3!
1.1.2018
Juliani Vidal is awarded the Frontier Fellowship. The Frontier Fellowship seeks to assist Montserrat students in expanding their experience in areas outside of their major or educational aspirations.
12.23.2017
The Language Learning Lab wraps up a successful semester. Check out our newsletter to see what we were up to this semester!
7.6.2017
The "Psycho Linguists" win the interlab kickball tournament 25-4 against the "Kickball Team in Memory of the Spanish Republic."
7.3.2017
The Language Learning Lab is excited to announce that our high school interns have now joined us for the summer. They will assist lab members with our current research projects. Please join us in welcoming all four high school summer interns Zoë, Jay, Ali, and Heather!
7.2.2017
Lauren Skorb's paper Food For Thought: Family Food Routines and Literacy in Latino Kindergarteners appears in the Journal of Applied Developmental Psychology.
6.26.2017
Joshua Hartshorne gives the keynote address at DETEC 2017 at the Max Planck Institute for Psycholinguistics in the Netherlands.
5.30.2017
L3 alumn Jesse Mu received the John J. Neuhauser award, which is given to a senior who has demonstrated outstanding achievement in computer science.
5.30.2017
L3 alumn Jesse Mu received the Thomas I. Gasson, S.J., President of Boston College (1907-1914), award, which is given to the graduating senior with a distinguished academic record over a four-year period.
5.24.2017
The lab has received a two-year Academic Technology Innovation Grant (ATIG) from the Academic Technology Advisory Board (ATAB) at Boston College.
5.22.2017
Congratulations to our graduating seniors: William Ades, Maria Valdivia Cox, Cora Ivesco, Liana Llado, and Jesse Mu!
The lab has received a three-year NSF grant entitled, "CompCog: Large-scale, empirically based, publicly accessible database of argument structure to support experimental and computational research."
8.10.2016
Joshua Hartshorne co-organized a workshop "Learning to Talk about Events: Grounding Language Acquisition in Intuitive Theories and Event Cognition" at the 2016 CogSci.
7.15.2016
Lauren Skorb joins as a coordinator.
7.1.2016
Joshua Hartshorne publishes a commentary on Lakoff's theory of metaphor.
Laura Niemi's paper on implicit causality was accepted for a talk at the 2016 CogSci.
4.11.2016
Joshua Hartshorne will be co-organizing a workshop "Learning to Talk about Events: Grounding Language Acquisition in Intuitive Theories and Event Cognition" at the next CogSci.
3.14.2016
Jon Ravid joins the lab as a Citizen Science developer/coordinator.