Generates synthetic visual reasoning tasks involving spatial puzzle solving. A structure has a missing piece outlined with a dashed line, and the task is to identify which of the four candidate pieces correctly fills the gap.
Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.
| Property | Value |
|---|---|
| Task ID | O-21 |
| Task | Construction Blueprint |
| Category | Abstraction |
| Resolution | 1024×1024 px |
| FPS | 16 fps |
| Duration | varies |
| Output | PNG images + MP4 video |
# Clone the repository
git clone https://github.com/VBVR-DataFactory/O-21_construction_blueprint_data-generator.git
cd O-21_construction_blueprint_data-generator
# Install dependencies
pip install -r requirements.txt# Generate 100 samples
python examples/generate.py --num-samples 100
# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42
# Generate without videos
python examples/generate.py --num-samples 100 --no-videos
# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output| Argument | Type | Description | Default |
|---|---|---|---|
--num-samples |
int | Number of samples to generate | 100 |
--seed |
int | Random seed for reproducibility | Random |
--output |
str | Output directory | data |
--no-videos |
flag | Skip video generation | False |
In the scene, the upper structure has a missing piece outlined with a dashed line. There are 4 candidate pieces below. The video sequentially checks each candidate from left to right: highlights the current candidate being examined with a frame, previews how the piece fits in the gap, marks it with ✓ if the shape matches or ✗ if it doesn't, then moves to the next candidate. Once the matching piece is found, an animation demonstrates it moving into the gap to complete the structure.
![]() |
![]() |
![]() |
| Initial Frame Structure with gap and 4 candidates |
Animation Testing candidates, finding match, filling gap |
Final Frame Completed structure with correct piece |
Identify the correct piece from four candidates that perfectly fills the missing gap in a structure, then animate the piece moving into place to complete the blueprint.
- Incomplete Structure: Block-based structure displayed on a grid with one piece missing
- Missing Piece Indication: Gap outlined with dashed lines showing required shape
- Four Candidates: Four different shaped pieces presented below the structure
- Grid System: 21×21 grid for precise spatial positioning
- Structure Complexity: 12-20 blocks in the complete structure
- Spatial reasoning: Tests ability to match shapes and visualize how pieces fit together
- Sequential evaluation: Systematically checks each candidate from left to right
- Visual feedback: Highlights current candidate and shows preview of fit
- Binary marking: Uses ✓ for correct match and ✗ for incorrect shapes
- Shape matching: Requires precise geometric comparison of piece shapes with gap
- Completion animation: Demonstrates the correct piece moving into position
- Puzzle solving: Combines visual search, shape recognition, and spatial transformation
data/questions/construction_blueprint_task/construction_blueprint_00000000/
├── first_frame.png # Initial state (structure with gap + candidates)
├── final_frame.png # Final state (completed structure)
├── prompt.txt # Task instructions
├── ground_truth.mp4 # Solution video (16 fps)
└── question_metadata.json # Task metadata
File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, duration varies based on number of candidates checked.
spatial-reasoning puzzle-solving shape-matching pattern-completion visual-search geometric-reasoning construction


