CORTEX

📄 Concept-Centric Token Interpretation for Vector-Quantized Generative Models

Tianze Yang†, Yucheng Shi†, Mengnan Du, Xuansheng Wu, Qiaoyu Tan, Jin Sun, Ninghao Liu
† Equal contribution

This is the official repository for our paper "Concept-Centric Token Interpretation for Vector-Quantized Generative Models", accepted at the International Conference on Machine Learning (ICML) 2025.
We propose a novel framework, CORTEX, for interpreting tokens in vector-quantized generative models through a concept-centric lens.

Figure: Our pipeline for token-level concept interpretation.

1. ⚙️ Environment

Requirement	Value
Python	3.12.3
Conda env	CORTEX

1.1 Create the environment

# Option A (preferred): use the YAML file
conda env create -f environment.yml   # creates env named “CORTEX”

# Option B: use the requirements file
conda create -n CORTEX python=3.12.3
conda activate CORTEX
pip install -r requirements.txt

1.2 Activate

conda activate CORTEX

2. 🗂 Repository Layout

CORTEX
├── VQGAN_explanation/   # Experiments & analyses based on VQGAN
├── Dalle_explanation/   # Experiments & analyses based on DALLE
├── environment.yml      # Conda environment specification (preferred)
├── requirements.txt     # Pip fallback dependency list
└── README.md            # Repository overview (you are here)

3. 🔬 Experiments

3.1 🧠 VQGAN Experiments

cd VQGAN_explanation

This subfolder contains the implementation of CORTEX to explain the VQGAN model.

📁 Directory Structure

CORTEX/VQGAN_explanation/
├── checkpoints/            # Model checkpoints (download required)
├── datasets/               # Datasets (download required)
├── eval/                   # Evaluation scripts
│   ├── codebook_level_explanation.py
│   ├── sample_concept_level_explanation.py
│   ├── sample_image_level_explanation.py
├── logs/                   # Training logs
├── results/                # Results directory
├── model.py                # IEM architecture
├── new_vqgan.py            # Prepare for the VQGAN repository
├── dataset.py              # Dataset loader
├── train.py                # Training script for IEM
├── test.py                 # Evaluation script
├── TIS_computation.py      # Token Importance Score computation
├── TIS_analysis.py         # TIS analysis for concept-level explanations
├── generate_freq_based_tokens.py # Generate frequency-based baseline

⚙️ Setup

Clone the repository of VQGAN
Place the new_vqgan.py file into the VQGAN repository under the taming-transformers/taming/models directory (If you want to run eval/codebook_level_explanation.py)
Download the datasets or generate your own dataset and replace the datasets directory
(The dataset was generated using the VQGAN model.)
Download pre-trained checkpoints or train your own IEMs and place them in the checkpoints directory

📥 Data and Checkpoints Download:
You can download our generated dataset from Download Datasets and Our pre-trained checkpoints from Download Checkpoints.

⚠️ Note: The dataset is quite large. For efficiency, we recommend generating only the required subset for your task instead of downloading the entire dataset.

🏋️‍♂️ 1. Training

You can train your own Interpretable Explanation Model (IEM) on different Vector-Quantized Generative Models (VQGMs).

🔢 Input Format

The model input is a token-based embedding with shape (256, 16, 16). To train IEMs on other VQGMs, you need to first generate the required dataset:

For each image, save its token-based embedding (of shape 256 × 16 × 16)
During generation, record the corresponding token indices and label
Save this metadata in a .csv file following the format of this train_embeddings.csv

▶️ Training Command

python train.py --model {model_name}

Where model_name ∈ {1, 2, 3, 4}.

🧪 2. Evaluation Preparation

📊 2.1 Test IEM Classification Performance

python test.py --model {model_name}

🎯 2.2 Compute Token Importance Scores (TIS)

python TIS_computation.py --model {model_name} --data_type {data_type} --batch_size {batch_size} --gpu {gpu_number}

model_name: 1, 2, 3, or 4
data_type:
- train: for eval/sample_concept_level_explanation.py
- test: for eval/sample_image_level_explanation.py
batch_size: Integer value
gpu_number: GPU device index

Example:

python TIS_computation.py --model 1 --data_type train --batch_size 25 --gpu 1

⚠️ This process may take considerable time depending on dataset and GPU.

📈 2.3 Generate Frequency-based Baseline

python generate_freq_based_tokens.py

🧠 2.4 Generate Sample Concept-level Tokens

python TIS_analysis.py --model {model_name}

🧵 3. Evaluation

📂 3.1 Navigate to the Evaluation Directory

cd eval

🖼️ 3.2 Sample Image-level Explanation

python sample_image_level_explanation.py --model {model_name}

🧩 3.3 Sample Concept-level Explanation

python sample_concept_level_explanation.py --model {model_name} --top_n {top_n_value} --token_num {token_num}

top_n: Select top-n tokens per image
token_num: Number of tokens to use

🔍 3.4 Codebook-level Explanation

Replace the line inside codebook_level_explanation.py:

VQGAN_directory = {Your VQGAN directory}

with your actual VQGAN repo path.

Run:

python codebook_level_explanation.py --model {model_name} --steps {optimization_steps} --lr {learning_rate} --optimization_type {token_selection or embedding}

Example:

python codebook_level_explanation.py --model 1 --optimization_type token

3.2 🧠 DALLE Experiments

cd Dalle_explanation

This subfolder contains the implementation of CORTEX to explain the DALL·E-mini model.

📁 Directory Structure

CORTEX/Dalle_explanation/
├── checkpoints/            # Model checkpoints (download required)
├── datasets/               # Datasets (download required)
├── bias_detection.py       # Bias detection using TIS
├── dataset.py              # Dataset loader
├── model.py                # IEM architecture
├── test.py                 # Evaluation script
├── train.py                # Training script for IEM
├── TIS_computation.py      # Token Importance Score computation
├── TIS_analysis.py         # TIS analysis

⚙️ Setup

Download the datasets generated by DALL·E-mini
and replace the datasets directory.

📥 Dataset Download: Datasets
Download the pre-trained checkpoints and place them in the checkpoints directory.

📥 Checkpoints Download: Checkpoints

⚠️ Note: In this experiment, we only pretrained the CNN-based model; you can train the IEM with other structures

🏋️‍♂️ 1. Training

You can train an Interpretable Explanation Model (IEM) on DALL·E-mini embeddings using:

python train.py --model 1

🧪 2. Evaluation Preparation

📊 2.1 Test IEM Classification Accuracy

python test.py --model 1 --bias_type doctor_color  # or doctor_gender

🔍 2.2 Compute Token Importance Scores (TIS)

python TIS_computation.py --model 1 --bias_type doctor_color  # or doctor_gender

🧠 2.3 Analyze TIS Results

python TIS_analysis.py --model 1 --bias_type doctor_color  # or doctor_gender

🧯 2.4 Bias Token Detection

python bias_detection.py --model 1 --bias_type doctor_color --top_n {top_n_value} --token_num {token_num_value}
# or use doctor_gender

📜 License

This project is licensed under the Apache License 2.0.
You may use, modify, and distribute this code under the terms of the license.

For full license details, please refer to the LICENSE file included in the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dalle_explanation		Dalle_explanation
VQGAN_explanation		VQGAN_explanation
README.md		README.md
environment.yaml		environment.yaml
license		license
pipeline.jpg		pipeline.jpg
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

CORTEX

📄 Concept-Centric Token Interpretation for Vector-Quantized Generative Models

1. ⚙️ Environment

1.1 Create the environment

1.2 Activate

2. 🗂 Repository Layout

3. 🔬 Experiments

3.1 🧠 VQGAN Experiments

📁 Directory Structure

⚙️ Setup

🏋️‍♂️ 1. Training

🔢 Input Format

▶️ Training Command

🧪 2. Evaluation Preparation

📊 2.1 Test IEM Classification Performance

🎯 2.2 Compute Token Importance Scores (TIS)

📈 2.3 Generate Frequency-based Baseline

🧠 2.4 Generate Sample Concept-level Tokens

🧵 3. Evaluation

📂 3.1 Navigate to the Evaluation Directory

🖼️ 3.2 Sample Image-level Explanation

🧩 3.3 Sample Concept-level Explanation

🔍 3.4 Codebook-level Explanation

3.2 🧠 DALLE Experiments

📁 Directory Structure

⚙️ Setup

🏋️‍♂️ 1. Training

🧪 2. Evaluation Preparation

📊 2.1 Test IEM Classification Accuracy

🔍 2.2 Compute Token Importance Scores (TIS)

🧠 2.3 Analyze TIS Results

🧯 2.4 Bias Token Detection

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages