Skip to content

Kidrauh/flow3r

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning

Zhongxiao Cong    Qitao Zhao   Minsik Jeon   Shubham Tulsiani

Carnegie Mellon University

arXiv Project Page
overview

Overview

Flow3r augments visual geometry learning with dense 2D correspondences (`flow') as supervision, enabling scalable training from unlabeled monocular videos. Flow3r achieves state-of-the-art results across eight benchmarks spanning static and dynamic scenes, with its largest gains on in-the-wild dynamic videos where labeled data is most scarce.

Quick Start

1. Create the environment

conda create -n flow3r python=3.11
conda activate flow3r

pip install -r requirements.txt

2. Download and place checkpoint

  • flow3r.bin: Flow3r trained on ~834k video sequences.

Please fetch the checkpoint manually from Google Drive and drop the file into checkpoints/.

3. Launch the Gradio app

python gradio_app.py 

Acknowledgements

  • Our work builds upon several fantastic open-source projects. We would like to acknowledge and thank the authors of:
  • We also thank the members of the Physical Perception Lab at CMU for their valuable discussions.

Citation

If you find our work useful, please cite:

@inproceedings{cong2026flow3r,
    title={Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning},
    author={Cong, Zhongxiao and Zhao, Qitao and Jeon, Minsik and Tulsiani, Shubham},
    booktitle={CVPR},
    year={2026}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages