ghostai1 commited on
Commit
9dadfff
·
verified ·
1 Parent(s): 22e9940

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -1
README.md CHANGED
@@ -10,5 +10,53 @@ pinned: false
10
  license: apache-2.0
11
  short_description: cpu image maker ai
12
  ---
 
 
 
 
 
13
 
14
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  license: apache-2.0
11
  short_description: cpu image maker ai
12
  ---
13
+ # 🤖✨ AI Image Generator (CPU)
14
+ [![Hugging Face Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces/your-username/cpu-image-gen)
15
+ [![Gradio UI](https://img.shields.io/badge/Gradio-5.31.0-brightgreen?logo=gradio)]
16
+ [![Model](https://img.shields.io/badge/Model-StableDiffusion-orange)](https://huggingface.co/runwayml/stable-diffusion-v1-5)
17
+ [![License](https://img.shields.io/badge/License-MIT-lightgrey)](LICENSE)
18
 
19
+ ---
20
+
21
+ ## 🚀 Overview
22
+ Leverage **Generative AI** and **latent diffusion** to turn text prompts into stunning images—**entirely on CPU**. No GPUs, no paid APIs—just open-source models and your browser.
23
+
24
+ > **Key AI buzzwords:**
25
+ > • Latent Diffusion • Denoising U-Net • Cross-Attention • Text-to-Image Generation • CLIP-Guided Prompting • Zero-Shot Creativity • Edge Inference • Cloud-Native Deployment
26
+
27
+ ---
28
+
29
+ ## ✨ Features
30
+
31
+ | 🔑 Feature | 🔍 Description |
32
+ |-------------------------------|-------------------------------------------------------------------------|
33
+ | **🌌 Creative Prompts** | Supports any imaginative text input—landscapes, portraits, abstract art |
34
+ | **⚙️ Adjustable Steps** | Control fidelity vs. speed (1–50 inference steps) |
35
+ | **💻 CPU-Only Inference** | Runs on free-tier Spaces (2 vCPU / 16 GB RAM), no CUDA required |
36
+ | **🎛️ Sleek UI** | Gradio Blocks for intuitive prompt entry and result display |
37
+ | **🔄 Stateless** | Each request is independent—no session state or logging overhead |
38
+ | **🔧 Modular** | Swap pipeline ID to any other text-to-image model with minimal change |
39
+
40
+ ---
41
+
42
+ ## 🏗️ Architecture & Workflow
43
+
44
+ 1. **Prompt Encoding**
45
+ The text prompt is tokenized and mapped into the model’s latent space.
46
+ 2. **Denoising Loop**
47
+ A U-Net repeatedly denoises latent noise over *N* steps guided by cross-attention to the prompt.
48
+ 3. **Decoder**
49
+ The final latent is decoded into an RGB image via the model’s VAE.
50
+ 4. **Rendering**
51
+ Gradio streams the generated image back to the browser for instant viewing.
52
+
53
+ ---
54
+
55
+ ## 🛠️ Local Development
56
+
57
+ ```bash
58
+ git clone https://github.com/your-username/cpu-image-gen.git
59
+ cd cpu-image-gen
60
+ python3 -m venv venv && source venv/bin/activate
61
+ pip install -r requirements.txt
62
+ python app.py