• AI SAMOSA
  • Posts
  • ChatGPT can See, Hear, and Speak - An Exciting Week In The AI Space

ChatGPT can See, Hear, and Speak - An Exciting Week In The AI Space

Howdy, Awesome People!

Welcome back to AI Samosa, your favorite AI Newsletter!

If this is your first time, having you here is great! Be sure to sign up to stay updated with the latest happenings in AI.

AI Bytes 📝 

1. Dall-E 3 Takes The Lead

DallE 3 is OpenAI's latest AI art generation model, offering enhanced prompt understanding, creativity promotion by avoiding copying living artists, and integration with ChatGPT for user interaction. Users retain full commercial rights to their creations. However, it also raises concerns about simplifying creativity. DallE 3 is expected to drive innovation in AI art generation, highlighting AI as a creative tool, not a replacement for human creativity. Read more…

2. ChatGPT can see, hear, and speak

OpenAI is introducing voice and image capabilities in ChatGPT, allowing users to engage in voice conversations and share images with the AI. The voice feature uses text-to-speech models created with professional voice actors and Whisper speech recognition. Image recognition is powered by multimodal GPT models. OpenAI is deploying these capabilities gradually to ensure safe and responsible usage, addressing challenges like impersonation and privacy concerns. They aim to make these features useful and safe for everyday tasks but caution users about the model's limitations, especially with non-English languages. Access will be expanded beyond Plus and Enterprise users in the future. Read more

3. ChatGPT has browsing capabilities, again!

OpenAI has introduced a web browsing feature for ChatGPT, expanding its access to information beyond its previous knowledge cutoff in September 2021. Initially available to Plus and Enterprise users, this feature allows users to browse the web by selecting "Browse with Bing" under GPT-4. OpenAI has also recently updated ChatGPT to enable voice conversations and interactions using images. Previously, they had tested a feature that allowed users to access information through Bing but later disabled it due to concerns about bypassing paywalls. ChatGPT has experienced rapid growth with 100 million monthly active users and has attracted investor interest, including discussions about a potential sale of shares at a higher valuation. Read more…

4. Amazon to invest $4 Billion in Anthropic

Amazon is investing up to $4 billion in Anthropic to collaborate on developing reliable and high-performing AI models. Amazon Web Services (AWS) will become Anthropic's primary cloud provider, offering access to AWS Trainium and Inferentia chips for model development. They will also expand support for Amazon Bedrock, allowing secure customization and fine-tuning of AI models. Amazon developers will use Anthropic's models to enhance applications. Various organizations are already using Anthropic models for tasks like legal search, financial analysis, and travel recommendations. Both Amazon and Anthropic are committed to responsible AI development and will conduct pre-deployment tests to manage AI risks. Amazon's investment will support AI research and safety efforts at Anthropic. Read more…

5. New AI Generator Uses Licensed Images

Getty Images has partnered with Nvidia to launch "Generative AI by Getty Images," an AI tool that allows users to create images using Getty's vast library of licensed photos. This tool is trained on Getty's extensive image collection, offering users legal protection for commercially published images. It excels at generating realistic human figures and is based on actual photos from the library. However, it has limitations when it comes to generating images involving real people. Generated images won't be added to Getty Images' content libraries. Creators will receive compensation if Getty uses their AI-generated image for training. The tool is available on the Getty Images website, priced separately from standard subscriptions, and grants users perpetual, worldwide, and unlimited image rights. Getty plans to expand the tool's capabilities to allow users to train the model with their own data and generate images in their brand style in the future. Read more…

This week’s how-to guide 🤯 

Generate Images without Prompts in Midjourney

Do you know what kind of images you want to create but don't know how to give a prompt? Midjourney's 'describe' feature is your saviour; simply provide reference images and Midjourney will generate a prompt; you may choose between four different prompts to generate an image and use features like custom zoom and custom vary to change elements in your generated image. (but more on that later!)

AI Art 🎨 

October 2nd, commemorates the birth anniversary of Mahatma Gandhi, the iconic leader of India's nonviolent struggle for independence, and serves as a global reminder of his enduring message of peace, unity, and social justice.

Narrative Nook📔

AI-generated story

Arctic Odyssey

Captain Archibald ventured to the Antarctic, driven by a vision of endless white. The frozen expanse tested his crew's unity. Amidst the icy crucible, Archibald's resolve burned brighter. He charted uncharted waters, facing krakens of doubt and storms of fear. One eve, they glimpsed a spectral light dancing upon icebergs. Archibald knew it as a beacon, guiding them to triumph. The frozen wasteland yielded its secrets, but the captain's unyielding spirit was the true treasure.

Mind Teasers: Trivia Time! 🧠 

Among these cloud images, can you spot the sneaky AI-generated one? Time to put your cloud-spotting skills to the test!

A

B

Last week’s trivia answer: A - AI-generated and B - Real photograph

Tool Spotlight 🛠️ 

ChattyDocs - Your Smart AI Assistant

ChattyDocs is a user-friendly platform that helps you interact with your documents and data using AI. You can easily upload files from your device or link to websites. Once your data is in, you can customize how the AI responds, making it more creative or focused. ChattyDocs also keeps track of the sources, so you know where the information is coming from. You can use it on your computer, phone, or even through Telegram. It's like having a smart assistant that makes your documents more accessible and easier to work with.

Editor's Note ✍️ 

This newsletter comes straight from the desk of John V. Jayakumar, CEO of Superposition Technologies.

My life's journey is a testament to the transformative power of education and technology—my dad broke free from poverty through education, and I achieved millionaire status thanks to technology. My takeaway for you Wholeheartedly embrace technology; it's a game-changer for you and everyone in your orbit.

Note: The views expressed are solely those of the editor.

Thanks to Beehiiv for hosting this newsletter. Click here to create your own newsletter on Beehiiv.

We Want to Hear From You!

Subscribe and share your thoughts with us! Leave Feedback

Welcome, and Thanks for being part of our community.