Self-evolving AI, robot fights, new GPT voice, new local image model, Gemma upgrade: AI NEWS

Self-evolving AI, robot fights, new GPT voice, new local image model, Gemma upgrade: AI NEWS

More

Descriptions:

AI Search rounds up a dense week of AI releases, covering new models and research across 3D reconstruction, image generation, voice, robotics, and open-source infrastructure. The video moves quickly through each item with benchmark comparisons, example outputs, and links to code and papers.

Standout announcements include RecGen, a 3D object reconstruction model trained on 200,000 synthetic assets that outperforms SAM-3D on occluded scene reconstruction and ships with open-source code; HyDream-01 Image by Vivago AI, an open-source image model generating 2K resolution output with strong text rendering and multi-reference composition in a single end-to-end architecture without a VAE; and OpenAI’s updated GPT-4o Realtime v2 voice model, which supports real-time translation and simultaneous multilingual dialogue, with per-model pricing now available via API. The video also covers Genesis AI’s GR-2.6.5 robotics foundation model, an AMD-trained model that outperforms Nvidia-trained equivalents above its weight class, and a new open-source video generator from a new lab producing 2K video that beats upscaler-based methods.

The format is rapid-fire but grounds each item in quantitative results where available, including pose estimation scores for RecGen and audio benchmark comparisons for GPT Realtime v2. A practical weekly reference for anyone tracking the leading edge of model releases across modalities.


📺 Source: AI Search · Published May 10, 2026
🏷️ Format: News Analysis

1 Item

Channels

2 Items

Companies