|
|
--- |
|
|
title: Arabic Transcriber Pro |
|
|
emoji: π£οΈ |
|
|
colorFrom: green |
|
|
colorTo: red |
|
|
sdk: streamlit |
|
|
sdk_version: 1.48.0 |
|
|
app_file: app.py |
|
|
pinned: true |
|
|
--- |
|
|
# ποΈ Arabic Transcriber Pro |
|
|
|
|
|
> **Convert Arabic speech to text with precision β powered by NVIDIA NeMo and Streamlit.** |
|
|
> β¨ Live Demo: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR) |
|
|
> π Portfolio: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/) |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://img.shields.io/badge/Python-3.8%2B-blue?style=for-the-badge&logo=python" /> |
|
|
<img src="https://img.shields.io/badge/Streamlit-1.28.0+-orange?style=for-the-badge&logo=streamlit" /> |
|
|
<img src="https://img.shields.io/badge/NVIDIA%20NeMo-ASR%20Model-blueviolet?style=for-the-badge&logo=nvidia" /> |
|
|
<img src="https://img.shields.io/badge/Hugging%20Face-Spaces-FF4B4B?style=for-the-badge&logo=huggingface" /> |
|
|
<img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" /> |
|
|
</div> |
|
|
|
|
|
<br /> |
|
|
|
|
|
 |
|
|
*Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization* |
|
|
|
|
|
--- |
|
|
|
|
|
## π Overview |
|
|
|
|
|
**Arabic Transcriber Pro** is a sleek, AI-powered web application that converts spoken **Arabic audio** into accurate, readable text using **NVIDIAβs state-of-the-art NeMo ASR model**. Designed with a modern, **gloomy-elegant aesthetic**, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β all within a user-friendly Streamlit interface hosted on **Hugging Face Spaces**. |
|
|
|
|
|
Built by **Yahya Alnwsany** β AI Engineer, NLP Specialist, and Hugging Face Ambassador β this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications. |
|
|
|
|
|
π **Live Demo**: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR) |
|
|
π€ **Developer Portfolio**: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/) |
|
|
|
|
|
--- |
|
|
|
|
|
## π§ Features |
|
|
|
|
|
- β
**High-Accuracy Arabic ASR** using `nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0` |
|
|
- π§ **Multi-Format Support**: WAV, MP3, OGG, FLAC, M4A |
|
|
- π **Auto Audio Conversion**: Resamples to 16kHz mono WAV for optimal model input |
|
|
- β‘ **Fast Processing** with real-time progress feedback |
|
|
- πΎ **Downloadable Transcripts** in `.txt` format |
|
|
- π **Web-Based UI** with Streamlit β no installation needed |
|
|
- π¨ **Elegant Dark Theme** with RTL-ready Arabic text rendering |
|
|
- π **Audio Metadata Display**: Duration, sample rate, channels |
|
|
- π **Cached Model Loading** for improved performance |
|
|
|
|
|
--- |
|
|
|
|
|
## πΌοΈ UI Design Highlights |
|
|
|
|
|
- **Color Palette**: Deep navy (`#0b132b`, `#1c2541`) with teal (`#5bc0be`) and coral (`#e55934`) accents |
|
|
- **Typography**: Clean, modern sans-serif with RTL support |
|
|
- **Interactive Elements**: Smooth progress bars, hover effects, and responsive layout |
|
|
- **Responsive Cards & Gradient Headers** for professional feel |
|
|
|
|
|
--- |
|
|
|
|
|
## π οΈ Tech Stack |
|
|
|
|
|
| Component | Technology | |
|
|
|------------------|----------| |
|
|
| Frontend | [Streamlit](https://streamlit.io) | |
|
|
| ASR Engine | [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) | |
|
|
| Audio Processing | `pydub`, `soundfile` | |
|
|
| Styling | Custom CSS (Dark Theme, RTL Support) | |
|
|
| Hosting | [Hugging Face Spaces](https://huggingface.co/spaces) | |
|
|
| Deployment | Docker / Streamlit / Git | |
|
|
|
|
|
--- |
|
|
|
|
|
## βΆοΈ Try It Live |
|
|
|
|
|
Visit the live app on Hugging Face: |
|
|
|
|
|
π [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR) |
|
|
|
|
|
No setup required β just upload an Arabic audio file and get instant transcription. |
|
|
|
|
|
--- |
|
|
|
|
|
## π¦ Project Structure |
|
|
```python |
|
|
Arabic-transcriber-pro/ |
|
|
β |
|
|
βββ app.py # Main Streamlit application |
|
|
βββ requirements.txt # Python dependencies |
|
|
βββ README.md # This file |
|
|
``` |
|
|
|
|
|
--- |
|
|
|
|
|
## π Supported Audio Formats |
|
|
|
|
|
| Format | Extension | Notes | |
|
|
|-------|----------|-------| |
|
|
| WAV | `.wav` | Native support | |
|
|
| MP3 | `.mp3` | Requires `ffmpeg` | |
|
|
| OGG | `.ogg` | Vorbis/Opus | |
|
|
| FLAC | `.flac` | Lossless | |
|
|
| M4A | `.m4a` | AAC audio | |
|
|
|
|
|
> π All files are automatically converted to **16kHz mono WAV** before transcription. |
|
|
|
|
|
--- |
|
|
|
|
|
## About the Developer |
|
|
|
|
|
### π€ [Yahya Alnwsany](https://nightprincey.github.io/Portfolio/)"# Arabic-Transcriber-Pro" |
|
|
|