Home Architecture News OpenAI’s GPT-4o reminds the AI assistant in Spike Jonze’s Her
Architecture News

OpenAI’s GPT-4o reminds the AI assistant in Spike Jonze’s Her

Share
OpenAI's GPT-4o reminds the AI assistant in Spike Jonze's Her
Share
GPT-4o AI model by OpenAI demonstrating real-time text and image interaction
© OpenAI

OpenAI has taken a big step forward in AI by introducing GPT-4o. The ‘o’ in the model name stands for ‘Omni.’ These developments remind many people of the AI assistant in Spike Jonze’s movie Her.

GPT-4o stands out with its ability to perform simultaneous translation. The new updated version can support 50 different languages. Additionally, GPT-4o can create instant interaction between text and image. The new model can serve as a voice assistant and a meeting and conversation tracker. Moreover, it can be used free without requiring a ChatGPT Plus subscription. Paid members can use it with more credits.

“It accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages.” stated OpenAI.

To access GPT-4o, simply visit the website and log in with your free or paid membership. Paid members can select GPT-4o from the drop-down menu in the upper left corner.

Free members will have GPT-4o automatically assigned to their account with limited use. Your account will direct you to GPT-3.5 when you reach the usage limit. Additionally, free members with access to GPT-4o can now submit files for analysis. This includes images, videos, and PDFs, and you can ask questions about the content.

The new model can understand real-time spoken conversations and interpret and respond without delay. Like all previous GPT-4 models, the new model can handle common text LLM tasks, including text summarization. The model can understand sound, images, and text at the same speed. GPT-4o can remember previous interactions.

Share
Written by
Serra Utkum Ikiz

Serra is passionate about researching and discussing cities, with a particular love for writing on urbanism, politics, and emerging design trends.

Leave a comment

Leave a Reply

Related Articles
​Japan Constructs World's First 3D-Printed Railway Station in Just Six Hours​
Architecture News

​Japan Constructs World’s First 3D-Printed Railway Station in Just Six Hours​

In a quiet corner of Wakayama Prefecture, a revolution in infrastructure quietly...

Expo 2025 Opens in Japan with Over 160 Countries Participating
Architecture News

Expo 2025 Opens in Japan with Over 160 Countries Participating

​Expo 2025 has officially opened in Osaka, Japan, transforming Yumeshima Island into...

​Grand Egyptian Museum Opens for Trial Visits Ahead of Grand Opening​
Architecture News

​Grand Egyptian Museum Opens for Trial Visits Ahead of Grand Opening​

After years of anticipation, the Grand Egyptian Museum (GEM), located beside the...

Foster and Partners' Rise Tower in Saudi Arabia Set to be the World's Tallest Tower Under Way
Architecture News

Foster and Partners’ Rise Tower in Saudi Arabia Set to be the World’s Tallest Tower Under Way

Recently, Saudi Arabia’s Public Investment Fund (PIF) put out a call inviting...

Subscribe to all newsletters

Join our community to receive the latest insights and updates!

© 2025 ParametricArchitecture. All Rights Reserved. By utilizing this website, you are consenting to our User Agreement, Privacy Policy, and Cookie Statement. In compliance with the privacy laws of Turkey and the United States, we recognize and respect your rights. Please be aware that we may receive commissions for products bought through our affiliate links. Unauthorized reproduction, distribution, or transmission of any material from this site is strictly forbidden without prior written permission from ParametricArchitecture.

ad blocker mark

AdBlocker Detected!

Help Us Keep Our Content Free

Your support helps us continue delivering high-quality resources at no cost to you.

We’ve detected that you are using an AdBlocker. We completely understand the need for a clean browsing experience, but ads help us keep this platform running and continue providing you with high-quality content at no cost.

If you enjoy our content, please consider disabling your AdBlocker or adding our site to your whitelist. Your support allows us to create more valuable articles, tutorials, and resources for you.

Thank you for being a part of our community!