Skip to content
@OSU-NLP-Group

OSU Natural Language Processing

Popular repositories Loading

  1. HippoRAG HippoRAG Public

    [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

    Python 3.2k 322

  2. Mind2Web Mind2Web Public

    [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

    Jupyter Notebook 941 119

  3. SeeAct SeeAct Public

    [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

    Python 822 107

  4. GUI-Agents-Paper-List GUI-Agents-Paper-List Public

    Building a comprehensive and handy list of papers for GUI agents

    Python 626 33

  5. TravelPlanner TravelPlanner Public

    [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

    Python 469 71

  6. MagicBrush MagicBrush Public

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Python 400 14

Repositories

Showing 10 of 63 repositories
  • cobalt Public

    Code and data for the paper "Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation"

    OSU-NLP-Group/cobalt’s past year of commit activity
    Python 2 MIT 0 0 0 Updated Feb 4, 2026
  • AutoElicit Public

    When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

    OSU-NLP-Group/AutoElicit’s past year of commit activity
    0 0 0 0 Updated Feb 4, 2026
  • saev Public

    Sparse autoencoders for vision

    OSU-NLP-Group/saev’s past year of commit activity
    Python 55 MIT 6 7 4 Updated Feb 2, 2026
  • GUI-Drag Public
    OSU-NLP-Group/GUI-Drag’s past year of commit activity
    Python 2 0 0 0 Updated Feb 1, 2026
  • Online-Mind2Web Public

    An Illusion of Progress? Assessing the Current State of Web Agents

    OSU-NLP-Group/Online-Mind2Web’s past year of commit activity
    Python 142 MIT 9 2 1 Updated Jan 2, 2026
  • SciNav Public
    OSU-NLP-Group/SciNav’s past year of commit activity
    0 0 0 0 Updated Dec 21, 2025
  • Mind2Web-2 Public

    [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

    OSU-NLP-Group/Mind2Web-2’s past year of commit activity
    Python 98 MIT 7 1 0 Updated Dec 18, 2025
  • TravelPlanner Public

    [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

    OSU-NLP-Group/TravelPlanner’s past year of commit activity
    Python 469 MIT 71 0 3 Updated Nov 7, 2025
  • Mind2Web Public

    [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

    OSU-NLP-Group/Mind2Web’s past year of commit activity
    Jupyter Notebook 941 MIT 119 7 6 Updated Nov 5, 2025
  • AgentSafety Public
    OSU-NLP-Group/AgentSafety’s past year of commit activity
    174 7 0 0 Updated Oct 31, 2025

Most used topics

Loading…