Paper Detail

Skim: Speculative Execution for Fast and Efficient Web Agents

Mike Wong, Kevin Hsieh, Suman Nath, Ravi Netravali

Browse

Workflow Queues

arxiv Score 11.5

Published 2026-05-15 · First seen 2026-05-25

Research Track B · General AI

Open paper source

Abstract

Skim is a speculative execution framework for web agents that exploits the predictable structure of purpose-built websites. Today's web-agent expense is not intrinsic to the tasks but a property of how agents are composed: frontier-model inference, browser rendering, and ReAct-style planning are applied to every step of every task regardless of complexity. Skim's key observation is that websites enforce stable URL patterns, answer formats, and task-to-trajectory mappings across queries of the same type, so most queries can bypass these heavyweight components entirely. An offline profiler captures these patterns once per site. At runtime, Skim matches each query to a template, synthesizes the destination URL, and extracts the answer with a small model. A lightweight verifier gates each fast-path output against the query and schema; rare misspeculations cascade to the full agent, warm-started by the fast path's final URL to preserve upstream trajectory progress. Across standard web-agent benchmarks paired with three backboneagents (WebVoyager, AgentOccam, BrowserUse), Skim reduces median per-task cost by 1.9x and latency by 33.4% with no accuracy loss.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: now
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@article{wong2026skim,
  title = {Skim: Speculative Execution for Fast and Efficient Web Agents},
  author = {Mike Wong and Kevin Hsieh and Suman Nath and Ravi Netravali},
  year = {2026},
  abstract = {Skim is a speculative execution framework for web agents that exploits the predictable structure of purpose-built websites. Today's web-agent expense is not intrinsic to the tasks but a property of how agents are composed: frontier-model inference, browser rendering, and ReAct-style planning are applied to every step of every task regardless of complexity. Skim's key observation is that websites enforce stable URL patterns, answer formats, and task-to-trajectory mappings across queries of the sa},
  url = {https://arxiv.org/abs/2605.16565},
  keywords = {cs.AI, cs.OS},
  eprint = {2605.16565},
  archiveprefix = {arXiv},
}

Metadata

{}