dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model

Yaxuan Li1 Zhongyi Zhou1 Yefei Chen1 Yaokai Xue1 Yichen Zhu2
1Current Robotics 2University of Toronto

Real Execution vs. World Model Rollouts

1. LIBERO Tasks

Pick up the black bowl on the cookie box and place it on the plate.
3rd-person
Wrist
3rd-person
Wrist
Pick up the black bowl on the cookie box and place it on the plate.
3rd-person
Wrist
3rd-person
Wrist
Pick up bowl from center place on plate.
3rd-person
Wrist
3rd-person
Wrist
Pick up bowl next to plate place on plate.
3rd-person
Wrist
3rd-person
Wrist

2. RoboTwin Tasks

Use one arm to pick up the can and move it to beside the pot.
Top
Left
Right
Top
Left
Right
If there is one bread on the table, use one arm to grab the bread and put it into the skillet.
Top
Left
Right
Top
Left
Right
If there is one bread on the table, use one arm to grab the bread and put it in the basket.
Top
Left
Right
Top
Left
Right
Use two arms to simultaneously grab two breads and put them in the basket.
Top
Left
Right
Top
Left
Right
Place the container onto the plate.
Top
Left
Right
Top
Left
Right
Use an arm to place the empty cup on the coaster.
Top
Left
Right
Top
Left
Right
Stack the bowls together using two arms.
Top
Left
Right
Top
Left
Right
Use one arm to pick up the can and move it to beside the pot.
Top
Left
Right
Top
Left
Right

3. Real-World Tasks

Pick up the hammer, then strike the red block.
Top
Left
Right
Top
Left
Right
Pass the red block to the right arm to place it on the blue mat.
Top
Left
Right
Top
Left
Right
Place the empty blue cup to the cup mat.
Top
Left
Right
Top
Left
Right
Pick up one bottle with one arm, and pick up another bottle with the other arm.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right
Clean the table.
Top
Left
Right
Top
Left
Right

Experimental Results