A Very Big Video Reasoning Suite
We bet on a future that video reasoning is the next fundamental intelligence paradigm, after language reasoning, where spatiotemporal embodied world experiences could be more naturally captured.
circle_maximum_value
GitHub
Prompt
There are multiple numbers on the screen, circle the one with the largest value
First Frame
Last Frame
Video
maintain_object_identity_different_objects
GitHub
Prompt
The left object is blue and the right object is dark blue. The scene shows two objects, one on the left and one on the right. Swap the positions of the left and right objects. After the swap, draw an arrow below the object that was originally on the left, pointing up at it.
First Frame
Last Frame
Video
key_door_matching
GitHub
Prompt
The scene shows a maze with a green circular agent, colored diamond-shaped keys, and colored hollow rectangular doors. Find the Blue key and then navigate to the matching Blue door, showing the complete movement process step by step.
First Frame
Last Frame
Video
symbol_substitute
GitHub
Prompt
Substitute ◯ at position 1 with a orange ◇. The animation shows the old symbol fading out completely, then the new symbol gradually fading in at the same position.
First Frame
Last Frame
Video
multi_object_placement
GitHub
Prompt
The scene contains multiple colored objects and star markers. Keep all star markers unchanged in position. Move each colored object to the star marker with the same color using straight paths, aligning the center of each object with the center of its matching star marker.
First Frame
Last Frame
Video
Domino Chain Prediction - Samples
00
01
02
03
04
Prompt
Loading...
Ground Truth
First
Final
Model Outputs
1/
VBVR-Wan2.2
VBVR-Wan2.2
CogVideoX 1.5
Kling 2.6
LTX-2
Runway Gen-4
Sora 2
Veo 3
Wan 2.2 I2V
Hunyuan I2V
Seedance 2.0
Leaderboard
Modality
Split
Type
Category