You Can Use ChatGPT With Your Voice Now

By Admin On Sep 25, 2023

LAHORE MIRROR — OpenAI, the pioneering artificial intelligence company that introduced ChatGPT to the world last November, is steadily closing the gap towards feature parity with other advanced AI assistants.

This advancement comes via an upgrade that enriches the chatbot’s capabilities with voice and image recognition.

The upgraded ChatGPT mobile applications, available on both iOS and Android, were unveiled recently. They now enable users to verbally communicate their queries to the chatbot and receive responses in its own synthesized voice. Moreover, the latest iteration of ChatGPT incorporates image recognition capabilities. Users can upload or capture images within the app, and ChatGPT will provide a detailed description of the image along with additional contextual information, reminiscent of Google’s Lens feature.

These enhancements highlight OpenAI’s approach of treating its long-standing artificial intelligence models as evolving products subject to frequent iterative improvements. ChatGPT, which has garnered significant attention and success, is progressively evolving into a consumer-oriented application akin to Apple’s Siri or Amazon’s Alexa.

The ability to engage in spoken conversations with ChatGPT relies on the integration of two separate models. First, Whisper, OpenAI’s existing speech-to-text model, converts spoken language into text, which is then processed by the chatbot. Second, a newly introduced text-to-speech model transforms ChatGPT’s responses into spoken language.

During a demonstration provided by the company, Joanne Jang, a product manager, showcased ChatGPT’s diverse synthetic voices. These voices were developed by training the text-to-speech model using recordings from hired actors. OpenAI is also considering the possibility of allowing users to create their custom voices in the future, prioritizing voices that are pleasant and enjoyable for users to listen to on a daily basis.