Skip to content

VBVR-DataFactory/O-21_construction_blueprint_data-generator

Repository files navigation

O-21: Construction Blueprint Data Generator

Generates synthetic visual reasoning tasks involving spatial puzzle solving. A structure has a missing piece outlined with a dashed line, and the task is to identify which of the four candidate pieces correctly fills the gap.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID O-21
Task Construction Blueprint
Category Abstraction
Resolution 1024×1024 px
FPS 16 fps
Duration varies
Output PNG images + MP4 video

🚀 Usage

Installation

# Clone the repository
git clone https://github.com/VBVR-DataFactory/O-21_construction_blueprint_data-generator.git
cd O-21_construction_blueprint_data-generator

# Install dependencies
pip install -r requirements.txt

Generate Data

# Generate 100 samples
python examples/generate.py --num-samples 100

# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42

# Generate without videos
python examples/generate.py --num-samples 100 --no-videos

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output

Command-Line Options

Argument Type Description Default
--num-samples int Number of samples to generate 100
--seed int Random seed for reproducibility Random
--output str Output directory data
--no-videos flag Skip video generation False

📖 Task Example

Prompt

In the scene, the upper structure has a missing piece outlined with a dashed line. There are 4 candidate pieces below. The video sequentially checks each candidate from left to right: highlights the current candidate being examined with a frame, previews how the piece fits in the gap, marks it with ✓ if the shape matches or ✗ if it doesn't, then moves to the next candidate. Once the matching piece is found, an animation demonstrates it moving into the gap to complete the structure.

Visual

Image Image Image
Initial Frame
Structure with gap and 4 candidates
Animation
Testing candidates, finding match, filling gap
Final Frame
Completed structure with correct piece

📖 Task Description

Objective

Identify the correct piece from four candidates that perfectly fills the missing gap in a structure, then animate the piece moving into place to complete the blueprint.

Task Setup

  • Incomplete Structure: Block-based structure displayed on a grid with one piece missing
  • Missing Piece Indication: Gap outlined with dashed lines showing required shape
  • Four Candidates: Four different shaped pieces presented below the structure
  • Grid System: 21×21 grid for precise spatial positioning
  • Structure Complexity: 12-20 blocks in the complete structure

Key Features

  • Spatial reasoning: Tests ability to match shapes and visualize how pieces fit together
  • Sequential evaluation: Systematically checks each candidate from left to right
  • Visual feedback: Highlights current candidate and shows preview of fit
  • Binary marking: Uses ✓ for correct match and ✗ for incorrect shapes
  • Shape matching: Requires precise geometric comparison of piece shapes with gap
  • Completion animation: Demonstrates the correct piece moving into position
  • Puzzle solving: Combines visual search, shape recognition, and spatial transformation

📦 Data Format

data/questions/construction_blueprint_task/construction_blueprint_00000000/
├── first_frame.png      # Initial state (structure with gap + candidates)
├── final_frame.png      # Final state (completed structure)
├── prompt.txt           # Task instructions
├── ground_truth.mp4     # Solution video (16 fps)
└── question_metadata.json # Task metadata

File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, duration varies based on number of candidates checked.


🏷️ Tags

spatial-reasoning puzzle-solving shape-matching pattern-completion visual-search geometric-reasoning construction


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages