QAPyramid

This repo contains the data and code for "QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization".

Data

QAPyramid data can be downloaded from Huggingface hub under shiyue/QAPyramid. We provide a notebook, data.ipynb, for basic usage of our dataset.

raw_data contains the raw annotations we collected from Amazon Mechanical Turk. It has the annotations from different annotators.

Annotation UIs

htmls contains the annotation UIs we used for data collection.

AutoQAPyramid

Requirement:

nltk
openai
datasets
qasem_parser

Script:

The code for running AutoQAPyramid on our benchmark is provided in run_autoqapyramid.py. This script leverages QASem for QA generation and GPT-4o for QA presence detection. The zero-shot variant of the approach is included by default.

The script first loads the dataset and then evaluates it using the AutoQAPyramid metric. The main function, run(), accepts the following inputs:

A list of reference summaries
A list of system-generated summaries
An optional list of QAs, where each QA is a tuple in the format (verb, question, answer)

If the QA tuples are not provided, the metric will automatically generate them using QASem. Please pass in OPENAI_API_KEY={your_api_key} to run gpt4o.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
htmls		htmls
raw_data		raw_data
LICENSE		LICENSE
QAPyramid.png		QAPyramid.png
README.md		README.md
data.ipynb		data.ipynb
run_autoqapyramid.py		run_autoqapyramid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QAPyramid

Data

Annotation UIs

AutoQAPyramid

Requirement:

Script:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

QAPyramid

Data

Annotation UIs

AutoQAPyramid

Requirement:

Script:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages