Paper Detail

Beyond Uniform Tokens: Adaptive Compression for Time Series Language Models

Jialin Gan, Xin Qiu, Guangzhe Chen, Xue Wang

Browse

Workflow Queues

arxiv Score 7.3

Published 2026-06-11 · First seen 2026-06-13

Research Track A · General AI

Open paper source

Abstract

Large language models (LLMs) have enabled time series (TS) analysis by jointly modeling numerical observations and textual context through a shared token interface. However, TS tokens and prompt tokens exhibit fundamentally different information structures, making uniform token processing inefficient. In this paper, we study token efficiency in TS language modeling from an asymmetric-token perspective. We show that TS tokens have highly uneven spectral contributions, where many tokens share redundant frequency patterns while a small subset preserves critical temporal evidence. We also observe that prompt-token influence attenuates with model depth, suggesting that full prompt retention across all layers is unnecessary. Based on these findings, we develop an adaptive token budgeting framework that compresses TS tokens via frequency-domain structure and progressively reduces prompt tokens across layers. Experiments across forecasting, classification, imputation, and anomaly detection demonstrate up to \textit{\textbf{7.68$\times$}} inference acceleration and performance gains in \textit{\textbf{78\%}} of evaluated settings, showing the effectiveness of asymmetric token compression for scalable TS foundation models.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: soon
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@article{gan2026beyond,
  title = {Beyond Uniform Tokens: Adaptive Compression for Time Series Language Models},
  author = {Jialin Gan and Xin Qiu and Guangzhe Chen and Xue Wang},
  year = {2026},
  abstract = {Large language models (LLMs) have enabled time series (TS) analysis by jointly modeling numerical observations and textual context through a shared token interface. However, TS tokens and prompt tokens exhibit fundamentally different information structures, making uniform token processing inefficient. In this paper, we study token efficiency in TS language modeling from an asymmetric-token perspective. We show that TS tokens have highly uneven spectral contributions, where many tokens share redu},
  url = {https://arxiv.org/abs/2606.13624},
  keywords = {cs.CL},
  eprint = {2606.13624},
  archiveprefix = {arXiv},
}

Metadata

{}