Omar Shaikh (@oshaikh13) / X

Omar Shaikh

845 posts

Omar Shaikh

@oshaikh13

member of sociotechnical staff @Stanford

🇸🇦→🇨🇦→🇺🇸→🇸🇦→🇺🇸

Joined December 2012

Pinned
Omar Shaikh
@oshaikh13
Mar 10
What’s the point of a “helpful assistant” if you have to always tell it what to do next? In a new paper, we introduce a reasoning model that predicts what you’ll do next over long contexts (LongNAP 💤). We trained it on 1,800 hours of computer use from 20 users. 🧵
00:00
123K
Omar Shaikh
@oshaikh13
Jun 9, 2025
What if LLMs could learn your habits and preferences well enough (across any context!) to anticipate your needs? In a new paper, we present the General User Model (GUM): a model of you built from just your everyday computer use. 🧵
00:00
71K
Omar Shaikh
@oshaikh13
Jul 27, 2025
BREAKING NEWS! Most people aren’t prompting models with IMO problems :) They’re prompting with tasks that need more context, like “plz make talk slides.” In an ACL oral, I’ll cover challenges in human-LM grounding (in 60K+ real interactions) & introduce a benchmark: RIFTS. 🧵
38K
Omar Shaikh
@oshaikh13
Aug 14, 2025
If you thought referencing past chats was cool, we built an MCP that lets Claude use *anything you see or do on your computer* as context.
00:00
00:34
Claude
@claudeai
Aug 11, 2025
Claude can now reference past chats, so you can easily pick up from where you left off.
64K
Omar Shaikh
@oshaikh13
Jun 10, 2024
LLMs sound homogeneous *because* feedback modalities like rankings, principles, and pairs cater to group-level preferences. Asking an individual to rank ~1K outputs or provide accurate principles takes effort. What if we relied on a few demos to elicit annotator preferences?
00:00
52K
Omar Shaikh
@oshaikh13
Mar 4, 2024
Before taking it out on your roommate for leaving dirty dishes out, you probably want to practice your conflict resolution skills first. Expert conflict resolution trainers, however, are EXPENSIVE. What if we practiced with an angry LLM instead? 😈 #CHI24 arxiv.org/abs/2309.12309
53K
Omar Shaikh
@oshaikh13
Dec 19, 2022
Chain of Thought reasoning prompts—like "Let's think step by step"—make large language models more performant. Including, it turns out, at spewing out toxic and biased content. In our preprint, we evaluate zero-shot CoT on harmful questions & stereotypes: arxiv.org/abs/2212.08061
32K
Omar Shaikh
@oshaikh13
Nov 16, 2023
Humans are strategic in establishing common ground: we ask clarification and follow-up questions, something LLMs rarely do. How should we measure this grounding alignment? Check out our preprint: “Grounding or Guesswork? LLMs are Presumptive Grounders” arxiv.org/pdf/2311.09144…
23K
Omar Shaikh
@oshaikh13
Apr 23, 2025
Hi! I'm gonna be presenting this at #ICLR2025 during the Thursday poster session (4/24; 3 p.m - 5:30 p.m, Hall 3 + Hall 2B #208). Come by if you want to talk about making ice cream!! (and also human-computer grounding, interacting with LMs, user models, etc.)
00:30
Omar Shaikh
@oshaikh13
Jun 10, 2024
LLMs sound homogeneous *because* feedback modalities like rankings, principles, and pairs cater to group-level preferences. Asking an individual to rank ~1K outputs or provide accurate principles takes effort. What if we relied on a few demos to elicit annotator preferences?
14K
Omar Shaikh
@oshaikh13
Jul 17, 2025
bro is cheating on this too💀
01:06
Roy
@im_roy_lee
Jul 15, 2025
in the past week at @cluely, we've been kicking off our most ambitious project ever. the models of today are great at answering questions. the models at @cluely will be really good at predicting which questions you have. this is a fundamentally different user experience than
9.7K
Omar Shaikh
@oshaikh13
Nov 18, 2023
i guess @openai waited till after the anonymity deadline
6.4K
Omar Shaikh
@oshaikh13
Apr 11, 2024
There’s been a lot of talk on using LLMs to entirely REPLACE people. We’ve been asking ourselves a different question: how can we use LLMs to make us better people? We think social skill training is a *very* promising answer!
Diyi Yang
@Diyi_Yang
Apr 9, 2024
Learning social skills is out of reach for most people🙁 How can we make social skill training more accessible? We introduce 🌟APAM🌟 (AI Partner and AI Mentor) that leverages LLMs for social skill training via realistic practice and tailored feedback!
17K
Omar Shaikh
@oshaikh13
Apr 16, 2024
Tired of your language model "delving" into things? Or maybe you like delving! We're working on a new interaction & method to customize language models, and we're looking for participants! If you're interested, please fill out the Google Form below.
docs.google.com
Interest in Customized Language Model Study
Hi there! We're looking for folks to try out a language model that's been customized to some of your tasks. If you're interested in the study, please reach out by providing your email below and...
30K
Omar Shaikh
@oshaikh13
Oct 15, 2020
“We published a paper. We used a model. We had result X.” Trying to write a persuasive message? The ordering of your rhetorical strategies matter! Check out our work at Findings of #emnlp2020 (with @jiaao_chen, Jon Saad-Falcon, @Diyi_Yang, @PoloChau). @ICatGT @mlatgt #nlproc