Paper Detail

FlowIt: Global Matching for Optical Flow with Confidence-Guided Refinement

Sadra Safadoust, Fabio Tosi, Matteo Poggi, Fatma Güney

arxiv Score 3.8

Published 2026-03-30 · First seen 2026-03-31

General AI

Abstract

We present FlowIt, a novel architecture for optical flow estimation designed to robustly handle large pixel displacements. At its core, FlowIt leverages a hierarchical transformer architecture that captures extensive global context, enabling the model to effectively model long-range correspondences. To overcome the limitations of localized matching, we formulate the flow initialization as an optimal transport problem. This formulation yields a highly robust initial flow field, alongside explicitly derived occlusion and confidence maps. These cues are then seamlessly integrated into a guided refinement stage, where the network actively propagates reliable motion estimates from high-confidence regions into ambiguous, low-confidence areas. Extensive experiments across the Sintel, KITTI, Spring, and LayeredFlow datasets validate the efficacy of our approach. FlowIt achieves state-of-the-art results on the competitive Sintel and KITTI benchmarks, while simultaneously establishing new state-of-the-art cross-dataset zero-shot generalization performance on Sintel, Spring, and LayeredFlow.

Workflow Status

Review status
pending
Role
unreviewed
Read priority
later
Vote
Not set.
Saved
no
Collections
Not filed yet.
Next action
Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

Tags

No tags.

BibTeX

@article{safadoust2026flowit,
  title = {FlowIt: Global Matching for Optical Flow with Confidence-Guided Refinement},
  author = {Sadra Safadoust and Fabio Tosi and Matteo Poggi and Fatma Güney},
  year = {2026},
  abstract = {We present FlowIt, a novel architecture for optical flow estimation designed to robustly handle large pixel displacements. At its core, FlowIt leverages a hierarchical transformer architecture that captures extensive global context, enabling the model to effectively model long-range correspondences. To overcome the limitations of localized matching, we formulate the flow initialization as an optimal transport problem. This formulation yields a highly robust initial flow field, alongside explicit},
  url = {https://arxiv.org/abs/2603.28759},
  keywords = {cs.CV},
  eprint = {2603.28759},
  archiveprefix = {arXiv},
}

Metadata

{}