top of page

latest stuff in ai, directly in your inbox. 🤗

Thanks for submitting!

A Sentient (Almost)?

In a groundbreaking announcement, OpenAI revealed that ChatGPT, already a leading figure in the generative AI assistant landscape, will soon be conversational not just through text but also via voice. The enhancement comes amidst an escalating competition among tech giants to dominate the generative AI space.


More Than Just Text


ChatGPT, which began as a text-based assistant, is now more versatile and interactive. The new voice-enabled features allow users to ask questions and request stories verbally. ChatGPT will respond in kind, offering spoken answers and narratives. Alongside this, users can also leverage image-based searches, uploading pictures for ChatGPT to identify or explain.


How It Works


Powering the voice feature is a state-of-the-art text-to-speech model capable of producing human-like voices from text and sampled speech. In a unique collaboration, OpenAI enlisted established voice actors to create five distinct voices for the assistant. OpenAI's proprietary Whisper speech recognition system is utilized to convert verbal prompts into text.


Strategic Partnerships


In an interesting turn, Spotify has teamed up with OpenAI for the launch. The music streaming platform is introducing a feature allowing podcasters to translate their shows into Spanish, French, or German while preserving their original voices. The technology will initially be available to specific podcasters, steering clear of any undue scrutiny or ethical concerns.


Security and Ethical Considerations


OpenAI remains cautious about the powerful capabilities it is introducing. While the voice technology paves the way for new creative and accessibility applications, it also brings forth risks, including the potential for malicious impersonations.


Limited Rollout


These new features will be first available to paying Plus and Enterprise subscribers. To access the voice capabilities, users have to opt-in via the app’s settings. Initially, voice features will be confined to the ChatGPT Android and iOS apps, while image search will be universally available.


The Competitive Landscape


The enhancements in ChatGPT come on the same day as Amazon’s $4 billion investment in AI startup Anthropic, signaling heated competition in the generative AI domain. Tech giants like Google, Meta, and Microsoft are also actively investing and developing their own AI solutions.


Closing Thoughts


With these new features, ChatGPT is tiptoeing the line between an advanced utility and something that feels almost sentient. As the realm of generative AI continues to evolve, ChatGPT’s new capabilities not only offer a look into what the future holds for AI but also invite important ethical and philosophical questions that society will soon have to answer.

The advent of voice-enabled ChatGPT serves as both an achievement in AI technology and a prompt for us to consider the responsible usage and limits of increasingly sentient AI systems. Time will tell how these developments reshape our interactions with technology and, by extension, with each other.

3 views0 comments

Comments

Couldn’t Load Comments
It looks like there was a technical problem. Try reconnecting or refreshing the page.

TOP AI TOOLS

snapy.ai

Snapy allows you to edit your videos with the power of ai. Save at least 30 minutes of editing time for a typical 5-10 minute long video.

- Trim silent parts of your videos
- Make your content more interesting for your audience
- Focus on making more quality content, we will take care of the editing

Landing AI

A platform to create and deploy custom computer vision projects.

SupaRes

An image enhancement platform.

MemeMorph

A tool for face-morphing and memes.

SuperAGI

SuperAGI is an open-source platform providing infrastructure to build autonomous AI agents.

FitForge

A tool to create personalized fitness plans.

FGenEds

A tool to summarize lectures and educational materials.

Shortwave

A platform for emails productivity.

Publer

An all-in-one social media management tool.

Typeface

A tool to generate personalized content.

Addy AI

A Google Chrome Exntesion as an email assistant.

Notability

A telegrambot to organize notes in Notion.

bottom of page