Paper Detail

Benchmarking Composed Image Retrieval for Applied Earth Observation

Bill Psomas, Dionysis Christopoulos, Thanasis Petropoulos, Nikos Efthymiadis, Ioannis Kakogeorgiou, Ondřej Chum, Yannis Avrithis, Giorgos Tolias, Konstantinos Karantzalos

huggingface Score 9.0

Published 2026-05-23 · First seen 2026-06-01

General AI

Abstract

Remote sensing composed image retrieval (RSCIR) enables search in large satellite image archives using composed queries that combine a reference image with a textual modifier. Although RSCIR offers a flexible interface for expressing targeted retrieval intent, the transferability of modern composition methods to Earth observation (EO) imagery and their relevance to operational EO workflows remain underexplored. We address this gap through a unified benchmark and an application-oriented study. First, we systematically adapt and evaluate representative composed image retrieval methods with six vision-language backbones on PatternCom under a standardized protocol, analyzing their behavior across backbones, composition strategies, and query types. Second, we introduce xView2-CIR, a change-centric dataset for disaster and damage monitoring, where retrieval is conditioned on scene identity and a target post-event state. Our results show that training-free composition methods provide strong and scalable baselines for EO retrieval, while change-centric retrieval presents different challenges from attribute-based retrieval, particularly due to the need to preserve scene identity. Overall, this study establishes a practical benchmark for RSCIR and positions composed retrieval as a complementary tool for remote sensing image retrieval, archive exploration, and change analysis. The dataset and code are available at https://github.com/billpsomas/rscir.

Workflow Status

Review status
pending
Role
unreviewed
Read priority
soon
Vote
Not set.
Saved
no
Collections
Not filed yet.
Next action
Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

Tags

No tags.

BibTeX

@misc{psomas2026benchmarking,
  title = {Benchmarking Composed Image Retrieval for Applied Earth Observation},
  author = {Bill Psomas and Dionysis Christopoulos and Thanasis Petropoulos and Nikos Efthymiadis and Ioannis Kakogeorgiou and Ondřej Chum and Yannis Avrithis and Giorgos Tolias and Konstantinos Karantzalos},
  year = {2026},
  abstract = {Remote sensing composed image retrieval (RSCIR) enables search in large satellite image archives using composed queries that combine a reference image with a textual modifier. Although RSCIR offers a flexible interface for expressing targeted retrieval intent, the transferability of modern composition methods to Earth observation (EO) imagery and their relevance to operational EO workflows remain underexplored. We address this gap through a unified benchmark and an application-oriented study. Fi},
  url = {https://huggingface.co/papers/2605.24442},
  keywords = {remote sensing composed image retrieval, vision-language backbones, PatternCom, xView2-CIR, change-centric retrieval, attribute-based retrieval, scene identity, compositional methods, huggingface daily},
  eprint = {2605.24442},
  archiveprefix = {arXiv},
}

Metadata

{}