Generates synthetic animal size sorting tasks. The goal is to sort animal faces by size (from smallest to largest or largest to smallest) and align them horizontally at the bottom baseline.
Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.
| Property | Value |
|---|---|
| Task ID | O-65 |
| Task | Animal Size Sorting |
| Category | Perception |
| Resolution | 1024×1024 px |
| FPS | 16 fps |
| Duration | ~3-5 seconds |
| Output | PNG images (no video by default) |
# Clone the repository
git clone https://github.com/Jiaqi-Gong/Gong_VBVR_Data.git
cd Gong_VBVR_Data/O-65_animal_size_sorting_data-generator
# Install dependencies
pip install -r requirements.txt# Generate 100 samples
python examples/generate.py --num-samples 100
# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42
# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output| Argument | Type | Description | Default |
|---|---|---|---|
--num-samples |
int | Number of samples to generate | Required |
--seed |
int | Random seed for reproducibility | Random |
--output |
str | Output directory | data/questions |
Animal faces of different sizes are scattered randomly on the canvas. Sort them by size from smallest to largest and align them horizontally at the bottom baseline.
![]() |
![]() |
![]() |
| Initial Frame Animal faces scattered randomly |
Animation Sorting and aligning process |
Final Frame Animals sorted by size and aligned |
Sort animal faces by size (ascending or descending order) and align them horizontally at the bottom baseline, demonstrating size comparison and spatial organization reasoning.
- Number of Animals: 3-5 animals
- Size Range: 30-110 pixels (minimum 25-40, maximum 70-100) - this generator involves size parameter comparison and modification, requiring sorting and repositioning based on size attributes
- Sort Order: Ascending (smallest to largest) or descending (largest to smallest)
- Layout: Animals scattered randomly initially, aligned horizontally at bottom baseline in final state
- Visual Clarity: Clear size differences between animals
- Size comparison: Tests ability to compare and order objects by size
- Spatial organization: Requires aligning objects in a specific layout
- Sorting logic: Must understand ascending vs descending order
- Visual reasoning: Must identify size relationships between objects
- Baseline alignment: All animals aligned at bottom baseline
- Random initial positions: Animals start in random scattered positions
data/questions/animal_size_sorting_task/animal_size_sorting_00000000/
├── first_frame.png # Initial state (animals scattered)
├── final_frame.png # Goal state (animals sorted and aligned)
├── prompt.txt # Task instructions
├── ground_truth.mp4 # Solution video (if generated)
└── question_metadata.json # Task metadata
File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps (if generated), approximately 3-5 seconds long.
animal-sorting size-comparison spatial-organization sorting-logic visual-reasoning logic symbol-manipulation


