Shipmas Day 1: Autonomous AI Social Media Video Converter App

Coding & Dev Tools7 months ago

Shipmas Day 1: Autonomous AI Social Media Video Converter App

Descriptions:

All About AI walks through building a landscape-to-vertical video converter from scratch using Claude Code (Opus 4.5), YOLO face detection, MediaPipe for lip-movement tracking, and FFmpeg for the final crop and encode. The creator uses Claude Code’s plan mode to scaffold the entire application before execution, letting the model coordinate multiple specialized Python libraries and generate FFmpeg commands dynamically based on detected face coordinates.

The build follows a real debugging cycle: the first output suffered from rapid flickering as the model switched between two detected speakers. The creator describes the fix in natural language to Claude Code — enforce higher-confidence speaker selection and hold the crop until probability shifts — and the second render is noticeably smoother. A Linus Tech Tips interview clip serves as the test source, with speaking-face detection confirmed across 1,900 frames.

The second phase of the project extends the workflow to longer input videos, adding clip selection logic so only the most relevant segments are extracted before conversion. This makes the pipeline suitable for repurposing long-form content into short vertical clips for TikTok or Instagram Reels. Developers and content creators will find it a practical reference for combining computer vision libraries with AI-orchestrated FFmpeg automation under a Claude Code agentic workflow.

📺 Source: All About AI · Published December 05, 2025
🏷️ Format: Hands On Build

1 Item

Channels

No Image Available

All About AI

Tags

Claude Claude Opus 4.5 FFmpeg

Prev

n8n Tutorial for Beginners 2026: How to Build AI Agents

n8n Tutorial for Beginners 2026: How to Build AI Agents

Next

World Models & General Intuition: Khosla’s largest bet since LLMs & OpenAI

World Models & General Intuition: Khosla’s largest bet since LLMs & OpenAI

18 Related Posts

Related Posts

09:39

Coding & Dev Tools

DeepSeek DFlash on Gemma 12B Locally: Up To 5x Faster

24 hours ago

15:45

Coding & Dev Tools

Every AI Agent Demo Stops at Email. I Pointed Mine at the Bills That Cost You Money.

24 hours ago

24:28

Coding & Dev Tools

Fable 5 is WILD…

2 days ago

08:08

Coding & Dev Tools

I Embedded Whisper.cpp Into a Real App

2 days ago

21:09

Coding & Dev Tools

I Built a Real AI Jarvis That Controls My Computer

3 days ago

13:29

Coding & Dev Tools

Control What Your AI Agents Can Do: Archestra + Ollama Hands-On

4 days ago