IDEAlign

Please note that we are currently working on updating the code and providing scripts!!

IDEAlign

IDEAlign is a framework for evaluating alignment between LLM-generated and expert annotations on open-ended, interpretive tasks. It consists of three stages: 1. Benchmarking expert similarity judgments via \emph{odd-one-out} tasks, 2. Validating automated (model) similarity methods (e.g., lexical, embedding-based, topic-based, and LLM-as-a-judge) against expert benchmarks by comparing answer distributions, and 3. Deploying the best-validated model to assess similarity of ideas generated by LLMs and domain experts at scale.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
llm_evaluation.py		llm_evaluation.py
pca_post_processing.py		pca_post_processing.py
text_embeddings.py		text_embeddings.py
topic_modeling.py		topic_modeling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IDEAlign

About

Uh oh!

Releases

Packages

Languages

EduNLP/IDEAlign

Folders and files

Latest commit

History

Repository files navigation

IDEAlign

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages