Share

WATCH | OpenAI introduces voice and image prompts to ChatGPT

accreditation
0:00
play article
Subscribers can listen to this article
  • OpenAI is bringing audio and image capabilities to ChatGPT. 
  • This new development has earned mixed reactions online. 
  • For more stories, visit the Tech and Trends homepage


OpenAI is bringing audio and image capabilities to ChatGPT.

The platform, which has long been limited to written prompts, will be adding the new features over the next two weeks to paid versions of the app, OpenAI announced in a blog post on Monday.

Everyone else will be receiving the features “soon after”.

What can you do with ChatGPT's update

Users can have voice conversations with the chatbot, bringing it closer to popular AI assistants such as Apple’s Siri and Amazon’s Alexa.

ChatGPT’s new voice feature can also narrate bedtime stories, settle debates at the dinner table and speak out loud text input from users.

The technology behind it is being used by Spotify for the platform’s podcasters to translate their content into different languages, OpenAI said.

Users can also upload one or multiple images to the interface, and use the drawing tool to highlight specific parts of the image.

The vision feature can be used to “troubleshoot why your grill won’t start, explore the contents of your fridge to plan a meal, or analyze a complex graph for work-related data”.

How have people responded? 

OpenAI’s announcement has invited a range of reactions on X, formerly Twitter. While some users have celebrated the new update, others have raised concerns.

One X user, Christopher CSI (@CSI9ja), wrote: 

As intriguing as this may sound, I certainly hope that the rapid advancements in technology and artificial intelligence do not lead to a situation reminiscent of the Y2K scare or a potential machine uprising. It’s essential for us to responsibly develop and manage these…

In a conversation with WIRED, Trevor Darrell, professor at UC Berkeley and a co-founder of Prompt AI, said that the fear of AI becoming too human-like is described as the “uncanny valley gap”.

While the added functions might make the chatbot feel more natural, some research suggests that complex interfaces that fail to mimic human interaction can feel strange to use, which might make the product harder to use.

Users are raising concerns about the recent lawsuits against OpenAI’s violation of copyright laws and infringement of intellectual property rights, advising others to not use ChatGPT.

Others have also brought up how the updates might replace smaller AI startups, software engineers, and even educators in the future.

The malicious use of AI voice generators is on the rise, where AI mimics the voice of a real person and calls their relatives for money. A McAfee report suggests that 77% of people targeted by an AI voice scam lost money as a result.

Additionally, the addition of voice recognition might make the feature less accessible to people who do not speak with mainstream accents, said Joel Fischer, who studies human-computer interaction at the University of Nottingham in the UK.

Since the image function allows the AI to recognise images, users are concerned that the bot might be able to bypass image verification CAPTCHA tests on websites.

These tests that require users to prove that they are not bots by transcribing distorted text and recognising images are designed to limit access.

recent study, that has yet to be peer reviewed, shows that AI bots can solve CAPTCHA tests faster and more accurately than humans.

OpenAI has acknowledged that the voice feature in the new update holds the potential for malicious actors to commit fraud and impersonation.

To avoid this, the company said it is “using this technology to power a specific use case”.

This happens to be voice chat created with voice actors the company directly worked with.

The company has also acknowledged the limitations of using images in AI, including image hallucinations where the AI generates false information about the image.

To counter this, OpenAI has taken technical measures to limit ChatGPT’s ability to analyse and make direct statements about people.



We live in a world where facts and fiction get blurred
Who we choose to trust can have a profound impact on our lives. Join thousands of devoted South Africans who look to News24 to bring them news they can trust every day. As we celebrate 25 years, become a News24 subscriber as we strive to keep you informed, inspired and empowered.
Join News24 today
heading
description
username
Show Comments ()
Voting Booth
Should the Proteas pick Faf du Plessis for the T20 World Cup in West Indies and the United States in June?
Please select an option Oops! Something went wrong, please try again later.
Results
Yes! Faf still has a lot to give ...
68% - 2185 votes
No! It's time to move on ...
32% - 1049 votes
Vote
Rand - Dollar
18.53
+0.2%
Rand - Pound
23.27
-0.0%
Rand - Euro
19.91
-0.0%
Rand - Aus dollar
12.20
-0.1%
Rand - Yen
0.12
-0.3%
Platinum
976.20
+1.8%
Palladium
939.00
+0.1%
Gold
2,300.82
-0.1%
Silver
26.54
-0.5%
Brent Crude
83.67
+0.3%
Top 40
70,586
+0.9%
All Share
76,748
+0.9%
Resource 10
60,763
+0.6%
Industrial 25
107,319
+1.4%
Financial 15
16,659
+0.4%
All JSE data delayed by at least 15 minutes Iress logo
Editorial feedback and complaints

Contact the public editor with feedback for our journalists, complaints, queries or suggestions about articles on News24.

LEARN MORE