OpenAI, the company behind the development of ChatGPT's models and products, is set to announce a new audio language model in the first quarter of 2026. According to a report by The Information, this model is intended as a step towards creating a physical device centered around audio capabilities.
The Information cites multiple sources, including current and former OpenAI employees, who indicate that the company is working on combining its engineering, product, and research teams to refine audio models. These models have lagged behind in accuracy and speed when compared to text models, according to company researchers.
Currently, the majority of ChatGPT users prefer texting over using the voice interface. OpenAI aims to transform user preferences by significantly enhancing its audio models, potentially paving the way for their use in diverse devices like automobiles.
OpenAI is in the process of planning a series of physical devices, starting with one focused on audio. Internal discussions have touched on various potential devices, such as smart speakers and eyeglasses, but the main emphasis is on utilizing audio interfaces over screen-based ones.