ChatGPT: Enhanced Vision, Audio, and Speech

What's Inside

New Voice and Image Capabilities in ChatGPT
Amazon Plans $4 Billion Investment in AI Startup Anthropic
Getty Images Generative AI
Spotify to mimic podcaster voices, translate into different languages
Sam Altman on AI Regulations

New Voice and Image Capabilities in ChatGPT

https://twitter.com/i/status/1706280618429141022

ChatGPT has gained the ability to see, hear, and speak. Over the next two weeks, Plus users will be granted the privilege of engaging in voice conversations with ChatGPT on both iOS and Android platforms. Additionally, they will also be able to incorporate images into their conversations, regardless of the platform.

Users can now utilize their voices to partake in interactive exchanges with ChatGPT. This newfound capability allows for seamless interactions while on the move, whether it's for a requested bedtime story or to settle a debate around the dinner table.

Furthermore, users can present one or more images to ChatGPT for assistance. Whether troubleshooting a non-starting grill, perusing the contents of their fridge to plan a meal, or dissecting a complex graph for work-related data, ChatGPT is poised to lend its expertise.

Amazon Plans $4 Billion Investment in AI Startup Anthropic

Amazon & Anthropic

As part of the deal with Amazon, Anthropic will use AWS as a primary cloud provider for mission-critical workloads, including safety research and future foundation model development, the e-commerce group said.

The deal with Anthropic gives Amazon, which is increasingly flexing its muscles around AI, a front-row seat at one of the fastest-growing AI startups.

“Customers are quite excited about Amazon Bedrock, AWS’s new managed service that enables companies to use various foundation models to build generative AI applications on top of, as well as AWS Trainium, AWS’s AI training chip, and our collaboration with Anthropic should help customers get even more value from these two capabilities.”

Getty Images uses GenAI, learns from its own licensed pictures

Getty Images is partnering with Nvidia to launch Generative AI by Getty Images, a new tool that lets people create images using Getty’s library of licensed photos.

Getty said any photos created with the tool will not be included in the Getty Images and iStock content libraries. The Company said it will pay creators if it uses their AI-generated image to train the current and future versions of the model.

Spotify to mimic podcaster voices, translate into different languages

That’s the premise behind Spotify’s new AI-powered voice translation feature, which reproduces podcasts in other languages using the podcaster’s voice.

The AI company is making a few announcements this morning, including the launch of a tool that can create “human-like audio from just text and a few seconds of sample speech.” OpenAI says it’s intentionally limiting how widely this tool will be available due to concerns around safety and privacy.

OpenAI CEO Sam Altman says it's possible to make mistakes with regulation, but we shouldn't be afraid of it!

"Regulation has been not a pure good, but it's been good in a lot of ways. I don't want to have to make an opinion about every time I step on an airplane how safe it's going to be, but I trust that they're pretty safe and I think regulation has been a positive good there," he said.

"It is possible to get regulation wrong, but I don't think we sit around and fear it. In fact we think some version of it is important."

For sharing any interesting details, please reach out to us through a direct message on Twitter: Saran

ChatGPT's New Abilities: Seeing, Listening, and Talking

What's Inside

New Voice and Image Capabilities in ChatGPT

Amazon Plans $4 Billion Investment in AI Startup Anthropic

Getty Images uses GenAI, learns from its own licensed pictures

Spotify to mimic podcaster voices, translate into different languages

OpenAI CEO Sam Altman says it's possible to make mistakes with regulation, but we shouldn't be afraid of it!

Comments

More from this blog

Unveiling Innovations at CES 2024: From AI to EVs and Beyond

SG Tech Scene : AI Design Studio, RobinAI, Smart Contracts

Advanced Robots by Google DeepMind for Performing Daily Activities

Singapore's Tech Scene: Crypto Scams, Deepfakes, Jailbroken AI, and 3D Mapping Innovations

Copyright Infringement: New York Times Takes Legal Action Against Microsoft and ChatGPT

Command Palette

What's Inside

New Voice and Image Capabilities in ChatGPT

Amazon Plans $4 Billion Investment in AI Startup Anthropic

Getty Images uses GenAI, learns from its own licensed pictures

Spotify to mimic podcaster voices, translate into different languages

OpenAI CEO Sam Altman says it's possible to make mistakes with regulation, but we shouldn't be afraid of it!

Comments

More from this blog