Spaces:
Running
on
Zero
Running
on
Zero
alex
commited on
Commit
Β·
585777e
1
Parent(s):
a921995
different layout and different examples
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
---
|
| 2 |
title: Ovi
|
| 3 |
-
emoji:
|
| 4 |
colorFrom: yellow
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
|
|
|
| 1 |
---
|
| 2 |
title: Ovi
|
| 3 |
+
emoji: π₯
|
| 4 |
colorFrom: yellow
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
app.py
CHANGED
|
@@ -240,7 +240,7 @@ with gr.Blocks(css=css) as demo:
|
|
| 240 |
"""
|
| 241 |
<div style="text-align: center;">
|
| 242 |
<p style="font-size:26px; display: inline; margin: 0;">
|
| 243 |
-
<strong
|
| 244 |
</p>
|
| 245 |
<a href="https://huggingface.co/chetwinlow1/Ovi" style="display: inline-block; vertical-align: middle; margin-left: 0.5em;">
|
| 246 |
[model]
|
|
@@ -257,7 +257,7 @@ with gr.Blocks(css=css) as demo:
|
|
| 257 |
with gr.Row():
|
| 258 |
with gr.Column():
|
| 259 |
# Image section
|
| 260 |
-
image = gr.Image(type="filepath", label="Image", height=
|
| 261 |
|
| 262 |
video_text_prompt = gr.Textbox(label="Video Prompt",
|
| 263 |
lines=5,
|
|
@@ -269,7 +269,7 @@ with gr.Blocks(css=css) as demo:
|
|
| 269 |
maximum=100,
|
| 270 |
step=1.0
|
| 271 |
)
|
| 272 |
-
run_btn = gr.Button("
|
| 273 |
|
| 274 |
with gr.Accordion("π¬ Video Generation Options", open=False, visible=False):
|
| 275 |
video_height = gr.Number(minimum=128, maximum=1280, value=512, step=32, label="Video Height")
|
|
@@ -289,35 +289,30 @@ with gr.Blocks(css=css) as demo:
|
|
| 289 |
|
| 290 |
|
| 291 |
with gr.Column():
|
| 292 |
-
output_path = gr.Video(label="Generated Video", height=
|
| 293 |
|
| 294 |
gr.Examples(
|
| 295 |
examples=[
|
| 296 |
|
| 297 |
[
|
| 298 |
-
"
|
| 299 |
-
"example_prompts/pngs/
|
| 300 |
50,
|
| 301 |
],
|
| 302 |
|
| 303 |
[
|
| 304 |
-
"
|
| 305 |
-
"example_prompts/pngs/
|
| 306 |
50,
|
| 307 |
],
|
| 308 |
|
| 309 |
[
|
| 310 |
-
"
|
| 311 |
-
"example_prompts/pngs/
|
| 312 |
-
100,
|
| 313 |
-
],
|
| 314 |
-
|
| 315 |
-
[
|
| 316 |
-
"The scene opens on a dimly lit stage where three men are positioned. On the left, a bald man in a dark suit with a partially visible colorful shirt stands behind a clear acrylic podium, which features a tree logo. He looks towards the center of the stage. In the center, a man wearing a blue and white striped long-sleeved shirt and dark pants actively gestures with both hands as he speaks, looking straight ahead. <S>Circuits choose conquest, not service.<E>, he explains, holding his hands out in front of him. To the right, and slightly behind him, a younger individual in a light-colored, patterned short-sleeved shirt and white shorts stands holding a rolled-up white document or poster. A large wooden cross draped with flowing purple fabric dominates the center-right of the stage, surrounded by several artificial rocks and dark steps. A large screen is visible in the background, slightly out of focus. The stage is bathed in selective lighting.. <AUDCAP>Male voice speaking clearly, consistent with a presentation or sermon, with a slight echo suggesting a large room or stage.<ENDAUDCAP>",
|
| 317 |
-
"example_prompts/pngs/13.png",
|
| 318 |
50,
|
| 319 |
],
|
| 320 |
|
|
|
|
| 321 |
],
|
| 322 |
inputs=[video_text_prompt, image, sample_steps],
|
| 323 |
outputs=[output_path],
|
|
|
|
| 240 |
"""
|
| 241 |
<div style="text-align: center;">
|
| 242 |
<p style="font-size:26px; display: inline; margin: 0;">
|
| 243 |
+
<strong>π₯ Ovi</strong> β Twin Backbone Cross-Modal Fusion for Audio-Video Generation
|
| 244 |
</p>
|
| 245 |
<a href="https://huggingface.co/chetwinlow1/Ovi" style="display: inline-block; vertical-align: middle; margin-left: 0.5em;">
|
| 246 |
[model]
|
|
|
|
| 257 |
with gr.Row():
|
| 258 |
with gr.Column():
|
| 259 |
# Image section
|
| 260 |
+
image = gr.Image(type="filepath", label="Image", height=360)
|
| 261 |
|
| 262 |
video_text_prompt = gr.Textbox(label="Video Prompt",
|
| 263 |
lines=5,
|
|
|
|
| 269 |
maximum=100,
|
| 270 |
step=1.0
|
| 271 |
)
|
| 272 |
+
run_btn = gr.Button("Action π¬", variant="primary")
|
| 273 |
|
| 274 |
with gr.Accordion("π¬ Video Generation Options", open=False, visible=False):
|
| 275 |
video_height = gr.Number(minimum=128, maximum=1280, value=512, step=32, label="Video Height")
|
|
|
|
| 289 |
|
| 290 |
|
| 291 |
with gr.Column():
|
| 292 |
+
output_path = gr.Video(label="Generated Video", height=360)
|
| 293 |
|
| 294 |
gr.Examples(
|
| 295 |
examples=[
|
| 296 |
|
| 297 |
[
|
| 298 |
+
"The video opens with a close-up of a woman with vibrant reddish-orange, shoulder-length hair and heavy dark eye makeup. She is wearing a dark brown leather jacket over a grey hooded top. She looks intently to her right, her mouth slightly agape, and her expression is serious and focused. The background shows a room with light green walls and dark wooden cabinets on the left, and a green plant on the right. She speaks, her voice clear and direct, saying, <S>doing<E>. She then pauses briefly, her gaze unwavering, and continues, <S>And I need you to trust them.<E>. Her mouth remains slightly open, indicating she is either about to speak more or has just finished a sentence, with a look of intense sincerity.. <AUDCAP>Tense, dramatic background music, clear female voice.<ENDAUDCAP>",
|
| 299 |
+
"example_prompts/pngs/8.png",
|
| 300 |
50,
|
| 301 |
],
|
| 302 |
|
| 303 |
[
|
| 304 |
+
"Two women, one with long dark hair and the other with long blonde hair, are illuminated by a blue and purple ambient light, suggesting a nightclub setting. They are seen in a close embrace, sharing a passionate kiss. The blonde-haired woman then slightly pulls away, her right hand gently touching the dark-haired woman's cheek as they exchange soft smiles, looking into each other's eyes. Moments later, they lean back in to kiss again, with the blonde-haired woman's finger delicately touching the dark-haired woman's lower lip. They remain in a tender, intimate embrace, their eyes closed as they share the kiss.. <AUDCAP>Upbeat electronic dance music with a driving beat and synth melodies plays throughout.<ENDAUDCAP>",
|
| 305 |
+
"example_prompts/pngs/80.png",
|
| 306 |
50,
|
| 307 |
],
|
| 308 |
|
| 309 |
[
|
| 310 |
+
"A bearded man wearing large dark sunglasses and a blue patterned cardigan sits in a studio, actively speaking into a large, suspended microphone. He has headphones on and gestures with his hands, displaying rings on his fingers. Behind him, a wall is covered with red, textured sound-dampening foam on the left, and a white banner on the right features the ""CHOICE FM"" logo and various social media handles like ""@ilovechoicefm"" with ""RALEIGH"" below it. The man intently addresses the microphone, articulating, <S>is talent. It's all about authenticity. You gotta be who you really are, especially if you're working<E>. He leans forward slightly as he speaks, maintaining a serious expression behind his sunglasses.. <AUDCAP>Clear male voice speaking into a microphone, a low background hum.<ENDAUDCAP>",
|
| 311 |
+
"example_prompts/pngs/5.png",
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 312 |
50,
|
| 313 |
],
|
| 314 |
|
| 315 |
+
|
| 316 |
],
|
| 317 |
inputs=[video_text_prompt, image, sample_steps],
|
| 318 |
outputs=[output_path],
|