Full body waifus, AI dreams, realtime AI music, open-source Gemini Omni: AI NEWS

Business & Strategy2 months ago

Full body waifus, AI dreams, realtime AI music, open-source Gemini Omni: AI NEWS

Descriptions:

AI Search delivers a packed weekly roundup covering more than a dozen model and tool releases across video, image, 3D, audio, and language domains. The top items include ByteDance’s Bernini, an open-source video editing model that accepts text, image, and video references for flexible scene manipulation — compared to an “open-source Gemini Omni” — available now on Hugging Face at roughly 84 GB per branch. NVIDIA’s Deja View is a 3D scene reconstruction model at just 117 million parameters that matches the performance of Depth Anything Three (approximately 10 times larger) by reusing the same transformer block in repeated passes rather than stacking layers, and is already open-sourced.

Google’s Gemma 4 12B earns significant coverage: a new encoder-free multimodal architecture that accepts text, images, and audio directly without a separate encoder step. It runs offline on 16 GB of VRAM, sits between the 4B phone-scale variant and the 26B mixture-of-experts model in the Gemma 4 family, matches the 24B variant on several benchmarks, and is released under Apache 2.0 for commercial use. Alibaba’s Qwen 3.7 Plus, a new frontier open model from Minimax, Baidu’s video model with natively baked-in audio, and Alibaba’s real-time streaming video generator round out the model news.

Additional coverage includes Google’s open-source real-time music generator, two new frontier image models, ChatGPT’s new “dream” feature, NVIDIA’s open-source world model, and humanoid robot demos. The video functions as a fast-scan orientation to a particularly active week in open-source and multimodal AI development.

📺 Source: AI Search · Published June 07, 2026
🏷️ Format: Roundup

1 Item

Channels

No Image Available

AI Search

5 Items

Companies

No Image Available

Baidu

No Image Available

Google

No Image Available

Microsoft

No Image Available

Nvidia

No Image Available

OpenAI

Tags

Prev

Anthropic Files $965B IPO, Trump Signs AI Executive Order, and ChatGPT Crosses 1B Users | EP #262

Next

Master Ideogram 4 Layouts: Pro Poster Design with Visual Prompt Builder

18 Related Posts

Related Posts

24:48

Business & Strategy

5 AI Engineering Trends That Non Engineers Should Know About

24 hours ago

32:10

Business & Strategy

Alexandr Wang: “This is a Once-in-a-Civilization Opportunity”

24 hours ago

54:58

Business & Strategy

I’m disappointed

24 hours ago

52:51

Business & Strategy

OpenAI JUST revealed the truth about it’s “Rogue Agent”

24 hours ago

05:00

Business & Strategy

Kimi K3 Just Broke The Economics Of AI

24 hours ago

10:15

Business & Strategy

Ilya Sutskever Just Found Something New In AI

24 hours ago