Skip to content

VBVR-DataFactory/G-200_circle_maximum_value_data-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

G-200: Circle Maximum Value Data Generator

Generates synthetic tasks where multiple numbers are displayed. The goal is to identify the number with the largest value and circle it.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.


📌 Basic Information

Property Value
Task ID G-200
Task Circle Maximum Value
Category Knowledge
Resolution 1024×1024 px
FPS 16 fps
Duration ~3.1 seconds
Output PNG images + MP4 video

🚀 Usage

Installation

# Clone the repository
git clone https://github.com/VBVR-DataFactory/G-200_circle_maximum_value_data-generator.git
cd G-200_circle_maximum_value_data-generator

# Install dependencies
pip install -r requirements.txt

Generate Data

# Generate 100 samples
python examples/generate.py --num-samples 100

# Generate with specific seed
python examples/generate.py --num-samples 100 --seed 42

# Generate without videos
python examples/generate.py --num-samples 100 --no-videos

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_output

Command-Line Options

Argument Type Description Default
--num-samples int Number of samples to generate 100
--seed int Random seed for reproducibility Random
--output str Output directory data/questions
--no-videos flag Skip video generation False

📖 Task Example

Prompt

There are multiple numbers on the screen, circle the one with the largest value

Visual

Initial Frame
Multiple numbers displayed
Animation
Red circle expands around maximum value
Final Frame
Maximum value circled

📖 Task Description

Objective

Identify the number with the largest numerical value among multiple displayed numbers and circle it.

Task Setup

  • Number Count: 4-6 numbers per image
  • Value Range: Numbers between 1 and 99
  • Number Types: Both single-digit and double-digit integers
  • Marking Method: Red circle expanding around the maximum value
  • Color Coding: Each number rendered in a distinct color for visual differentiation
  • Layout: Numbers positioned randomly with minimum spacing of 150px to prevent overlap

Key Features

  • Numerical comparison: Tests ability to identify maximum value among multiple numbers
  • Visual search: Requires scanning all numbers to find the largest
  • Clear animation: Red circle expands smoothly over 20 frames
  • Color differentiation: Each number has unique color to aid distinction
  • Unambiguous answer: Only one number is the maximum
  • Font consistency: All numbers use same font size (120pt) for fair comparison

📦 Data Format

data/questions/circle_maximum_value_task/circle_maximum_value_00000000/
├── first_frame.png      # Initial state (numbers without marking)
├── final_frame.png      # Goal state (maximum circled)
├── prompt.txt           # Task instructions
├── ground_truth.mp4     # Solution video (16 fps)
└── question_metadata.json # Task metadata

File specifications: Images are 1024×1024 PNG. Videos are MP4 at 16 fps, approximately 3.1 seconds long.


🏷️ Tags

counting recognition numerical-comparison maximum-value circling visual-search cognitive-reasoning


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages