Descriptions:
Nerdy Rodent covers two ComfyUI node packs for OmniVoice, a newly released high-quality voice cloning and text-to-speech model, within days of its public availability. The video demonstrates practical integration workflows for each option, notes their differences in features and usability, and includes audio samples generated live using the tool.
The first node pack (Comic NDR nodes) provides a loader and main OmniVoice node with optional reference voice input for cloning. The tutorial covers voice cloning from as little as 3 seconds of reference audio, voice design using comma-separated descriptors (with caveats about a limited vocabulary for accent specification), multilingual output across 600+ languages, and markdown-based pronunciation control for ambiguous words. Recommended settings are shared to avoid common artifact problems, particularly around the shift parameter. The second node pack (from Saganaki22) adds built-in Whisper transcription, four node variants including a multi-speaker mode, auto model downloading, and slightly different parameter ranges — though it trades off visible transcription text for convenience.
The video is structured as a side-by-side comparison with audio demonstrations of both nodes using the same reference voice, making it straightforward to judge quality differences. For anyone building voice-enabled ComfyUI pipelines or exploring local TTS alternatives, this serves as a practical starting point for OmniVoice integration.
📺 Source: Nerdy Rodent · Published April 04, 2026
🏷️ Format: Tutorial Demo







