Thinking Machines Lab has unveiled “interaction models,” a new kind of multimodal AI built to communicate in real time. Unlike systems that wait for separate inputs, these models process audio and visuals together, enabling continuous responses and sharply lower latency. The goal: make human AI collaboration feel more natural, especially for time sensitive enterprise and industrial use cases.
NVIDIA has unveiled Nemotron-3 Nano Omni, a unified multimodal AI designed to connect sight, sound, and language within a single system. The company says it enables faster, more intuitive interactions and a more human-like grasp of real-world context in real time, signaling a shift from separate capabilities to integrated understanding.
Your news, in seconds
Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.
Swipe through stories, personalise your feed, and save articles for later — all on the app.