Paper Detail

Bayesian Sparse Low-Rank Adaptation for Large Language Model Uncertainty Estimation

Jijie Zhang, Zhe Ren, Quan Zhang, Dandan Guo

Browse

Workflow Queues

arxiv Score 12.6

Published 2026-07-02 · First seen 2026-07-03

General AI

Open paper source

Abstract

Large language models (LLMs) exhibit remarkable reasoning capabilities, but their task-specific fine-tuning is notoriously plagued by overconfidence, severely hindering trustworthy deployment. We propose Data-Adaptive Lower-Rank Adaptation (DALorRA), a simple and effective variational Bayesian sparse framework that shifts the paradigm of uncertainty quantification from the dense parameter space to the lightweight rank level of low-rank adaptation (LoRA). With the insight that LoRA essentially aggregates multiple rank-one components that may provide superfluous model capacity, DALorRA imposes stochastic masking on rank dimensions, enabling Bayesian regularization of model capacity during training and ensemble-like calibration during inference. Extensive experiments demonstrate DALorRA's excellent calibration of LLMs without compromising reasoning accuracy.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: now
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@article{zhang2026bayesian,
  title = {Bayesian Sparse Low-Rank Adaptation for Large Language Model Uncertainty Estimation},
  author = {Jijie Zhang and Zhe Ren and Quan Zhang and Dandan Guo},
  year = {2026},
  abstract = {Large language models (LLMs) exhibit remarkable reasoning capabilities, but their task-specific fine-tuning is notoriously plagued by overconfidence, severely hindering trustworthy deployment. We propose Data-Adaptive Lower-Rank Adaptation (DALorRA), a simple and effective variational Bayesian sparse framework that shifts the paradigm of uncertainty quantification from the dense parameter space to the lightweight rank level of low-rank adaptation (LoRA). With the insight that LoRA essentially ag},
  url = {https://arxiv.org/abs/2607.02182},
  keywords = {cs.LG, cs.CL},
  eprint = {2607.02182},
  archiveprefix = {arXiv},
}

Metadata

{}