Descriptions:
Google’s Gemini Omni brings avatar-based video generation directly into the Gemini app, and this hands-on walkthrough by Paul J Lipsky explores what the tool can actually do in practice. The feature lets users set up a personal avatar — captured through a brief face-scanning process similar to what OpenAI’s Sora offered before its shutdown — and then insert themselves into virtually any scene using text prompts or predefined templates.
Lipsky tests a range of templates including “Metallic,” “Meme Me,” and “Indie Pastel” (which he describes as carrying a Wes Anderson aesthetic), and also experiments with fully custom prompts such as placing himself at a restaurant or teaching a college math course. Notable findings include consistent 10-second clip length, roughly sub-2-minute generation times, 720p download quality, and solid performance on targeted edits — changing a single detail like vest color while leaving the rest of the scene intact.
The video also covers Omni’s ability to accept uploaded footage and apply stylistic changes, such as adding an active volcano to a driving clip, as well as Google’s stated intent to ground Omni outputs in real-world search data for location accuracy. Limitations noted include weak support for using a prior frame as a video starting point. For anyone curious about the current state of consumer-facing AI video generation, this gives a practical, example-rich overview of where Gemini Omni stands.
📺 Source: Paul J Lipsky · Published May 19, 2026
🏷️ Format: Tutorial Demo







