OpenAI’s Chatbot Becomes More Human-Like, Now Capable Of Conversing And Visual Recognition
Summary: The organization behind the widely-used ChatGPT service has introduced enhancements enabling users to engage with its AI bot through voice and images.
OpenAI, the AI giant headquartered in San Francisco, has unveiled an updated version of its chatbot, ChatGPT, which now possesses the ability to both converse verbally and process images by users.
The latest iteration of ChatGPT, launched on Monday, introduces two significant enhancements, bringing it closer to human-like interactions. Firstly, it can now engage users with synthetic voices that are said to be more human-sounding compared to other digital assistants. Users have the option to select from five distinct voice choices, including both male and female voices. Secondly, it has gained the ability to provide responses based on images submitted by users. As an example, users can share a picture of their refrigerator's contents, and ChatGPT can suggest recipes based on the available ingredients.
ChatGPT utilizes a vast language model (LLM) that has acquired the ability to produce natural language by studying vast amounts of internet text. While the introduction of voice support might make ChatGPT appear similar to voice assistants like Siri and Alexa, it stands apart due to its LLM technology. This distinction allows ChatGPT to handle a broad spectrum of subjects and assignments without requiring pre-programming. It can spontaneously compose and, now, even vocalize emails, poems, academic papers, and jokes.
To enable voice, users can visit Settings > New Features in the mobile app. As for images, ChatGPT can describe and answer questions about uploaded images, benefiting those with visual impairments or seeking information from visuals.
The latest ChatGPT version is now accessible to ChatGPT Plus and Enterprise subscribers over the next two weeks. Voice functionality is limited to iPhones, iPads, and Android devices, while the image feature is available on both web and mobile platforms. OpenAI has been rapidly unveiling AI tools, including the integration of its DALL-E image generator into ChatGPT, allowing users to request image creation from the chatbot.
Also Read: Amazon Enhances Alexa's Capabilities with Advanced Generative AI Model
ChatGPT, introduced in November last year, has amassed millions of users and inspired competitors like Google Bard and Microsoft Bing. With its latest version, OpenAI takes the lead in the conversational AI field, challenging established technologies such as Alexa and Siri.
Related: Google Ramps Up Competition Against ChatGPT with Gemini AI Software