Shipmas Day 12: AI Music Video Generator App

Shipmas Day 12: AI Music Video Generator App

More

Descriptions:

The All About AI Shipmas Day 12 episode walks through building a fully automated AI music video generator, chaining four distinct services into a single Next.js web application. The pipeline starts with Nana Banana Pro for character image generation, uses Gemini to write lyrics, hands the lyrics to Suno API for music production, transcribes the resulting MP3 with Whisper to locate the chorus, clips 10 seconds with ffmpeg, and finally feeds the image and audio into ByteDance’s OmniHuman model to animate the still photo into a singing music video.

The build is done live in Cursor using a plan-first approach — the AI drafts a 12-step implementation plan before writing any code — which gives viewers a clear view of how a complex multi-API application gets structured. Debugging is shown in real time, including fixing a Fal client initialization error and iterating on the Streamlit-style UI layout.

The finished demo, tested with a Taylor Swift piano ballad prompt, successfully detects and clips the chorus section and renders a synchronized music video available for in-browser playback and download. A follow-up feature added during the video lets users bring their own images and audio clips rather than generating everything from scratch. The result is a replicable template for creators who want to produce AI music videos without deep technical expertise.


📺 Source: All About AI · Published December 16, 2025
🏷️ Format: Hands On Build

1 Item

Channels