OpenAI Set to Launch New Audio Language Model as Foundation for Future Hardware Devices

OpenAI, the company behind the development of ChatGPT's models and products, is gearing up to unveil a new audio language model in the first quarter of 2026. According to The Information, this move is a strategic step toward creating an audio-centric hardware device.

The report, which cites sources familiar with OpenAI's plans—including both current and former employees—suggests that the company is consolidating various teams from engineering, product, and research divisions. This unified initiative is dedicated to advancing audio models, which are purportedly behind their text counterparts in terms of accuracy and speed.

Currently, a majority of ChatGPT users prefer text interactions over voice. With enhanced audio models, OpenAI hopes to pivot user behavior towards voice interfaces, facilitating broader deployment of these models and products into devices like in-car systems.

Looking ahead, OpenAI intends to roll out a series of physical devices, beginning with one focused on audio. Internal discussions have explored numerous potential forms for these devices, including smart speakers and glasses, but the central theme throughout the lineup is prioritizing audio interfaces over screen-based solutions.