Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 5 items • Updated about 8 hours ago • 17
GLiNER-decoder Collection A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition • 3 items • Updated about 19 hours ago • 17
X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System Paper • 2512.18706 • Published Dec 21, 2025 • 1
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 10 days ago • 33
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published Dec 29, 2025 • 28
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated about 8 hours ago • 36
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation Paper • 2508.19320 • Published Aug 26, 2025 • 29
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 22 items • Updated 10 days ago • 98
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Sep 13, 2025 • 10
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 186
LongAI Collection Boost AI's Long ability, while keeping Efficient. Models in this collection includes LongVILA, LongVILA-R1, LongLive. • 8 items • Updated Nov 6, 2025 • 2