Paper Detail

CABLE: Cloud-Assisted Bandwidth-efficient LMM-based Encoding for V2X Systems

Haohua Que, Zhipeng Bao, Qianyi Wu, Handong Yao

Browse

Workflow Queues

arxiv Score 6.3

Published 2026-06-17 · First seen 2026-06-18

General AI

Open paper source

Abstract

Cloud-hosted large multimodal models (LMMs) can provide strong open-vocabulary perception for Vehicle-to-Everything systems, but naively transmitting full-resolution frames from edge to cloud causes severe communication overhead and high cloud-side prefill latency. We present CABLE, a cloud-assisted bandwidth-efficient LMM-based encoding framework for edge-cloud perception. CABLE propagates the previous cloud segmentation mask on the edge using ego-motion compensation, refines it with residual-motion cues, and consolidates disconnected regions via a corridor envelope to form a robust region of interest (ROI). Only ROI-masked images are uploaded, while the cloud segmentation output is fed back as the prior for the next frame, forming a mask-to-ROI-to-LMM feedback loop. Experiments on five datasets (nuScenes, WOD-ZB, Waymo, KITTI, and CADC) show consistent communication savings while largely preserving perception, achieving $73$--$87\%$ ROI pixel-coverage reduction with $5$--$8\times$ estimated LMM prefill speedup at a modest detection-quality trade-off relative to full-frame inference.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: soon
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@article{que2026cable,
  title = {CABLE: Cloud-Assisted Bandwidth-efficient LMM-based Encoding for V2X Systems},
  author = {Haohua Que and Zhipeng Bao and Qianyi Wu and Handong Yao},
  year = {2026},
  abstract = {Cloud-hosted large multimodal models (LMMs) can provide strong open-vocabulary perception for Vehicle-to-Everything systems, but naively transmitting full-resolution frames from edge to cloud causes severe communication overhead and high cloud-side prefill latency. We present CABLE, a cloud-assisted bandwidth-efficient LMM-based encoding framework for edge-cloud perception. CABLE propagates the previous cloud segmentation mask on the edge using ego-motion compensation, refines it with residual-m},
  url = {https://arxiv.org/abs/2606.19258},
  keywords = {cs.CV, cs.RO},
  eprint = {2606.19258},
  archiveprefix = {arXiv},
}

Metadata

{}