---
title: Arabic Transcriber Pro
emoji: π£οΈ
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.48.0
app_file: app.py
pinned: true
---
# ποΈ Arabic Transcriber Pro
> **Convert Arabic speech to text with precision β powered by NVIDIA NeMo and Streamlit.**
> β¨ Live Demo: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
> π Portfolio: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)

*Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization*
---
## π Overview
**Arabic Transcriber Pro** is a sleek, AI-powered web application that converts spoken **Arabic audio** into accurate, readable text using **NVIDIAβs state-of-the-art NeMo ASR model**. Designed with a modern, **gloomy-elegant aesthetic**, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β all within a user-friendly Streamlit interface hosted on **Hugging Face Spaces**.
Built by **Yahya Alnwsany** β AI Engineer, NLP Specialist, and Hugging Face Ambassador β this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications.
π **Live Demo**: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
π€ **Developer Portfolio**: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)
---
## π§ Features
- β
**High-Accuracy Arabic ASR** using `nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0`
- π§ **Multi-Format Support**: WAV, MP3, OGG, FLAC, M4A
- π **Auto Audio Conversion**: Resamples to 16kHz mono WAV for optimal model input
- β‘ **Fast Processing** with real-time progress feedback
- πΎ **Downloadable Transcripts** in `.txt` format
- π **Web-Based UI** with Streamlit β no installation needed
- π¨ **Elegant Dark Theme** with RTL-ready Arabic text rendering
- π **Audio Metadata Display**: Duration, sample rate, channels
- π **Cached Model Loading** for improved performance
---
## πΌοΈ UI Design Highlights
- **Color Palette**: Deep navy (`#0b132b`, `#1c2541`) with teal (`#5bc0be`) and coral (`#e55934`) accents
- **Typography**: Clean, modern sans-serif with RTL support
- **Interactive Elements**: Smooth progress bars, hover effects, and responsive layout
- **Responsive Cards & Gradient Headers** for professional feel
---
## π οΈ Tech Stack
| Component | Technology |
|------------------|----------|
| Frontend | [Streamlit](https://streamlit.io) |
| ASR Engine | [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) |
| Audio Processing | `pydub`, `soundfile` |
| Styling | Custom CSS (Dark Theme, RTL Support) |
| Hosting | [Hugging Face Spaces](https://huggingface.co/spaces) |
| Deployment | Docker / Streamlit / Git |
---
## βΆοΈ Try It Live
Visit the live app on Hugging Face:
π [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
No setup required β just upload an Arabic audio file and get instant transcription.
---
## π¦ Project Structure
```python
Arabic-transcriber-pro/
β
βββ app.py # Main Streamlit application
βββ requirements.txt # Python dependencies
βββ README.md # This file
```
---
## π Supported Audio Formats
| Format | Extension | Notes |
|-------|----------|-------|
| WAV | `.wav` | Native support |
| MP3 | `.mp3` | Requires `ffmpeg` |
| OGG | `.ogg` | Vorbis/Opus |
| FLAC | `.flac` | Lossless |
| M4A | `.m4a` | AAC audio |
> π All files are automatically converted to **16kHz mono WAV** before transcription.
---
## About the Developer
### π€ [Yahya Alnwsany](https://nightprincey.github.io/Portfolio/)"# Arabic-Transcriber-Pro"