Paper Detail

Tadabur: A Large-Scale Quran Audio Dataset

Faisal Alherran

Browse

Workflow Queues

huggingface Score 3.5

Published 2026-04-21 · First seen 2026-04-23

General AI

Open paper source

Abstract

Despite growing interest in Quranic data research, existing Quran datasets remain limited in both scale and diversity. To address this gap, we present Tadabur, a large-scale Quran audio dataset. Tadabur comprises more than 1400+ hours of recitation audio from over 600 distinct reciters, providing substantial variation in recitation styles, vocal characteristics, and recording conditions. This diversity makes Tadabur a comprehensive and representative resource for Quranic speech research and analysis. By significantly expanding both the total duration and variability of available Quran data, Tadabur aims to support future research and facilitate the development of standardized Quranic speech benchmarks.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: later
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@misc{alherran2026tadabur,
  title = {Tadabur: A Large-Scale Quran Audio Dataset},
  author = {Faisal Alherran},
  year = {2026},
  abstract = {Despite growing interest in Quranic data research, existing Quran datasets remain limited in both scale and diversity. To address this gap, we present Tadabur, a large-scale Quran audio dataset. Tadabur comprises more than 1400+ hours of recitation audio from over 600 distinct reciters, providing substantial variation in recitation styles, vocal characteristics, and recording conditions. This diversity makes Tadabur a comprehensive and representative resource for Quranic speech research and anal},
  url = {https://huggingface.co/papers/2604.18932},
  keywords = {code available, huggingface daily},
  eprint = {2604.18932},
  archiveprefix = {arXiv},
}

Metadata

{}