Home Architecture News OpenAI’s GPT-4o reminds the AI assistant in Spike Jonze’s Her
Architecture News

OpenAI’s GPT-4o reminds the AI assistant in Spike Jonze’s Her

Share
OpenAI's GPT-4o reminds the AI assistant in Spike Jonze's Her
Share
GPT-4o AI model by OpenAI demonstrating real-time text and image interaction
© OpenAI

OpenAI has taken a big step forward in AI by introducing GPT-4o. The ‘o’ in the model name stands for ‘Omni.’ These developments remind many people of the AI assistant in Spike Jonze’s movie Her.

GPT-4o stands out with its ability to perform simultaneous translation. The new updated version can support 50 different languages. Additionally, GPT-4o can create instant interaction between text and image. The new model can serve as a voice assistant and a meeting and conversation tracker. Moreover, it can be used free without requiring a ChatGPT Plus subscription. Paid members can use it with more credits.

“It accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages.” stated OpenAI.

To access GPT-4o, simply visit the website and log in with your free or paid membership. Paid members can select GPT-4o from the drop-down menu in the upper left corner.

Free members will have GPT-4o automatically assigned to their account with limited use. Your account will direct you to GPT-3.5 when you reach the usage limit. Additionally, free members with access to GPT-4o can now submit files for analysis. This includes images, videos, and PDFs, and you can ask questions about the content.

The new model can understand real-time spoken conversations and interpret and respond without delay. Like all previous GPT-4 models, the new model can handle common text LLM tasks, including text summarization. The model can understand sound, images, and text at the same speed. GPT-4o can remember previous interactions.

Share
Written by
Serra Utkum Ikiz

Serra is passionate about researching and discussing cities, with a particular love for writing on urbanism, politics, and emerging design trends.

Leave a comment

Leave a Reply

Related Articles
Rojkind Arquitectos & Multiplicities Lead Collaborative Vision for Durrës Port Revitalization
Architecture News

Rojkind Arquitectos & Multiplicities Lead Collaborative Vision for Durrës Port Revitalization

A new chapter in Albania’s urban transformation is unfolding with the revitalization...

Michael C. Rockefeller Wing Reopens at NYC’s Metropolitan Museum of Art
Architecture News

Michael C. Rockefeller Wing Reopens at NYC’s Metropolitan Museum of Art

After four years of meticulous renovation, the Michael C. Rockefeller Wing at...

MVRDV’s Vision for the New Wuhan Library in China
Architecture News

MVRDV’s Vision for the New Wuhan Library in China

The award-winning Dutch Architectural firm MVRDV, in collaboration with UAD, is shaping...

FIFA World Cup 2026 Venues: Stadium Innovations Across USA, Mexico, and Canada
Architecture News

FIFA World Cup 2026 Venues: Stadium Innovations Across USA, Mexico, and Canada

The 2026 FIFA World Cup will be the biggest event in the...

Subscribe to all newsletters

Join our community to receive the latest insights and updates!

© 2025 ParametricArchitecture. All Rights Reserved. By utilizing this website, you are consenting to our User Agreement, Privacy Policy, and Cookie Statement. In compliance with the privacy laws of Turkey and the United States, we recognize and respect your rights. Please be aware that we may receive commissions for products bought through our affiliate links. Unauthorized reproduction, distribution, or transmission of any material from this site is strictly forbidden without prior written permission from ParametricArchitecture.

ad blocker mark

AdBlocker Detected!

Help Us Keep Our Content Free

Your support helps us continue delivering high-quality resources at no cost to you.

We’ve detected that you are using an AdBlocker. We completely understand the need for a clean browsing experience, but ads help us keep this platform running and continue providing you with high-quality content at no cost.

If you enjoy our content, please consider disabling your AdBlocker or adding our site to your whitelist. Your support allows us to create more valuable articles, tutorials, and resources for you.

Thank you for being a part of our community!