SVI2.0 PRO:Automate Your AI Movie: From 1 Image to 5 Scenes with Gemini Prompt Engineering & ComfyUI

Coding & Dev Tools6 months ago

SVI2.0 PRO:Automate Your AI Movie: From 1 Image to 5 Scenes with Gemini Prompt Engineering & ComfyUI

Descriptions:

Veteran AI presents a complete pipeline for generating long-form, multi-scene AI video from a single reference image, combining SVI 2.0 Pro with the Smooth Mix model and Gemini 1.5 Pro for automated prompt generation. The approach goes beyond simple video extension—it produces narrative sequences with distinct scene changes, camera cuts, and environmental transitions rather than looping a single motion.

The key model swap here is from the standard Wan 2.2 Image-to-Video model to Smooth Mix, which is natively accelerated (no separate LoRA needed) and solves two problems from the original: color shifting across extended clips and failure to complete key actions within the allotted frames. The tradeoff is reduced character consistency compared to vanilla Wan 2.2, which the host demonstrates with direct examples.

The Gemini-based prompt engineering system is a standout feature. The host provides a bilingual (English and Chinese) system instruction template that, when loaded into Gemini 1.5 Pro, takes a reference image and outputs five structured shot prompts: a character/narrative analysis, per-shot motion and focus notes, and the final clean prompts ready to paste into the ComfyUI workflow. The first prompt drives base video generation; the remaining four feed into a loop node for sequential extension. A “Motion Latent Count” setting of 1 vs. 2 controls how much continuity is maintained between scenes. The full workflow is hosted on RunningHub for immediate testing.

📺 Source: Veteran AI · Published January 06, 2026
🏷️ Format: Hands On Build

1 Item

Channels

No Image Available

Veteran AI

Tags

ComfyUI runningHub

Prev

Claude Agent SDK [Full Workshop] — Thariq Shihipar, Anthropic

Claude Agent SDK [Full Workshop] — Thariq Shihipar, Anthropic

Next

I Built a New AI System in 3 Hours (and got paid $1650)

I Built a New AI System in 3 Hours (and got paid $1650)

18 Related Posts

Related Posts

09:39

Coding & Dev Tools

DeepSeek DFlash on Gemma 12B Locally: Up To 5x Faster

24 hours ago

15:45

Coding & Dev Tools

Every AI Agent Demo Stops at Email. I Pointed Mine at the Bills That Cost You Money.

24 hours ago

24:28

Coding & Dev Tools

Fable 5 is WILD…

2 days ago

08:08

Coding & Dev Tools

I Embedded Whisper.cpp Into a Real App

2 days ago

21:09

Coding & Dev Tools

I Built a Real AI Jarvis That Controls My Computer

3 days ago

13:29

Coding & Dev Tools

Control What Your AI Agents Can Do: Archestra + Ollama Hands-On

4 days ago