ECCV Poster Understanding Physical Dynamics with Counterfactual World Modeling

Poster

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh · Honglin Chen · Kevin Feigelis · Daniel M Bear · Khaled Jedoui · Klemen Kotar · Felix J Binder · Wanhee Lee · Sherry Liu · Kevin Smith · Judith E. Fan · Daniel Yamins

[ Abstract ] [ Paper PDF ]

[ Poster] [ Supplemental]

2024 Poster

Abstract:

The ability to understand physical dynamics is critical for agents to act in the world. Here, we use Counterfactual World Modeling (CWM) to extract vision structures for dynamics understanding. CWM uses a temporally-factored masking policy for masked prediction of video data without annotations. This policy enables highly effective ``counterfactual prompting'' of the predictor, allowing a spectrum of visual structures to be extracted from a single pre-trained predictor in a zero-shot manner. We demonstrate that these structures are useful for physical dynamics understanding, allowing CWM to achieve the state-of-the-art performance on the Physion benchmark.

Chat is not available.