METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling

ACL-2025 (Main Conference)

Bingxuan Li, Yiwei Wang, Jiuxiang Gu, Kai-Wei Chang, Nanyun Peng

Introduction

Direct prompting of current VLMs (e.g. GPT- 4O) often fails togenerate charts that accurately replicate reference charts, resulting in errors in structure, color, and text alignment. We propose METAL to tackle this challenge with iterative refinement through generation, critique, and revision. Emperical result shows that METAL exhibits the phenomenon of test-time scaling: its performance increases monotonically as the logarithmic computational budget grows from 512 to 8192 tokens.

Quick Start

1. Environment Setup

Create your Conda environment using the provided YAML file:

conda env create -f environment.yml

2. Configure Environment Variables

Edit the file src/.env to set up the required environment variables:

PROJECT_PATH={PATH_TO_CHARTMIMIC_DIR}
EASYOCR_MODEL_PATH={PATH_TO_EASYOCR_MODEL}
OPENAI_ORG={YOUR_OPENAI_ORG}
OPENAI_API_KEY={YOUR_OPENAI_API_KEY}

Replace the placeholders with your specific paths and keys.

3. Download the Dataset

Navigate to the dataset directory and run the download script:

cd dataset
sh get_dataset.sh

4. Run METAL and Its Variations

Before running the script, modify the shell script scripts/run_metal.sh with the appropriate paths and settings. Then, run:

sh scripts/run_metal.sh

Inside the script, ensure you set the following variables:

DATASET_DIR: Path to the dataset (e.g., ../../dataset)
WORKING_HOME_DIR: Directory where results and working files will be stored
DATA_RANGE: Data range (e.g., 0-100)
MODEL: Supported models (e.g., llama3_2, gpt-4o)
SYSTEM: Supported systems (e.g., Metal, Metal-v, Metal-c, Metal-s)
MAX_ITER: Maximum number of iterations
N_PROCESS: Number of parallel processes (typically set to 1)
CUDA_DEVICES: CUDA device IDs (e.g., 0,1)

5. Run Baselines

Similarly, modify the shell script scripts/run_baselines.sh with your specific paths and settings. Then, run:

sh scripts/run_baselines.sh

Set the following variables in the script:

DATASET_DIR: e.g., ../../dataset
WORKING_HOME_DIR: Your desired working directory
DATA_RANGE: e.g., 0-100
MODEL: Supported models (e.g., llama3_2, gpt-4o)
SYSTEM: Baseline systems (e.g., HintEnhanced, Best-of-N, SelfRevision)
MAX_ITER: Maximum iterations
N_PROCESS: Recommended to be 8 for baselines
CUDA_DEVICES: e.g., 0,1

Notes

Adjust all placeholder values (e.g., {YOUR_MODEL}, {PATH_TO_DATASET_DIR}) to suit your local setup.
Error logs are stored at the specified LOG_FILE_PATH. Monitor these for debugging purposes.

Acknowledgements

The dataset, direct generation code, and evaluation code are adapted from ChartMIMIC.

BibTex

If you find our work helpful, please kindly cite our work :)

@misc{li2025metalmultiagentframeworkchart,
          title={METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling}, 
          author={Bingxuan Li and Yiwei Wang and Jiuxiang Gu and Kai-Wei Chang and Nanyun Peng},
          year={2025},
          eprint={2502.17651},
          archivePrefix={arXiv},
          primaryClass={cs.CV},
          url={https://arxiv.org/abs/2502.17651}, 
    }

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
src		src
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
metal.png		metal.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling

Table of Contents

Introduction

Quick Start

1. Environment Setup

2. Configure Environment Variables

3. Download the Dataset

4. Run METAL and Its Variations

5. Run Baselines

Notes

Acknowledgements

BibTex

About

Uh oh!

Releases

Packages

Languages

metal-chart-generation/metal

Folders and files

Latest commit

History

Repository files navigation

METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling

Table of Contents

Introduction

Quick Start

1. Environment Setup

2. Configure Environment Variables

3. Download the Dataset

4. Run METAL and Its Variations

5. Run Baselines

Notes

Acknowledgements

BibTex

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages