3 Minutes
Most AI systems still behave like polite walkie talkies. You speak, they wait. They answer, you stop. That rhythm has shaped nearly every chatbot people use today, and it is exactly what Thinking Machines Lab now wants to break.
The startup, founded by former OpenAI CTO Mira Murati, has unveiled what it calls interaction models, a new approach designed to make AI feel less like a delayed message thread and more like an actual conversation. The core idea is simple, but ambitious: instead of treating listening and responding as separate turns, the model handles both at once.
That shift matters. In real human dialogue, people do not communicate in perfectly queued blocks. They pause, react, jump in, adjust their tone, and sometimes interrupt. Thinking Machines is betting that AI should work the same way from the ground up, rather than relying on add-on voice layers that only imitate natural exchange.
More phone call than chatbot
The company’s research preview centers on a model called TML Interaction Small. Technically, it is built for full duplex communication, meaning it can process incoming speech while generating a reply in parallel. In plain English, the system is meant to listen while it talks.
According to Thinking Machines, the model responds in around 0.40 seconds, a speed close to natural human back and forth. If that figure holds up outside the lab, it could mark a meaningful step forward for conversational AI, especially in voice assistants, live support tools, and digital agents that need to feel less robotic and more present.
That is also where the broader industry race gets interesting. OpenAI, Google, and other major players have all pushed toward faster, more fluid voice interaction, but the experience can still feel staged. A model designed with interactivity as a native behavior, rather than something layered on later, could change the texture of those exchanges in a noticeable way.
For now, though, this is still a research story, not a consumer launch. The company is not opening the model to the public yet. A limited research preview is expected in the coming months, with a broader release planned later this year.
So, is this the start of a more genuinely conversational AI era? Maybe. The benchmark claims are promising, and the premise is hard to ignore. But slick latency numbers and real world conversation are not always the same thing. Until people can test how the model handles interruptions, messy speech, hesitation, and unpredictability, some skepticism is healthy.
Still, the idea lands at the right moment. The AI industry has spent years teaching machines to answer. Now the pressure is shifting toward something harder: teaching them how to interact. That is a very different challenge, and Thinking Machines clearly wants to be early to it.
Source: techcrunch
Comments
Armin
Is this even true? 0.40s in lab ok, but real people are messy, accents, interruptions... won't be that simple, right?
mechbyte
Whoa, this could make voice AI feel alive. Game changer? maybe, if it handles overlap, stutters, ppl talking over each other...
Leave a Comment