OpenAI Enhances GPT-4 Turbo Model for API and ChatGPT, Including Visual Input
Majorly improved GPT-4 Turbo model available now in the API and rolling out in ChatGPT. https://t.co/HMihypFusV
— OpenAI (@OpenAI) April 9, 2024
OpenAI is releasing an improved version of its GPT-4 Turbo large language model employed by ChatGPT and its third-party API. The new model adds updates to its knowledge base and marks the first time OpenAI has incorporated generative AI vision features into the LLM, augmenting it to understand and respond to videos and images.
GPT-4 Turbo
The GPT-4 Turbo with Vision embedded in the new model offers a streamlined way for developers to build apps that can handle both text and images with one API call. The idea is to simplify developer workflows, further enabling the creation of more streamlined and efficient applications. This move mirrors efforts by other tech giants like Google with its Gemini Pro 1.5 model, although such advanced capabilities are currently reserved for developer use.
Previously, OpenAI’s focus within ChatGPT included text, images, and audio analysis. This update extends its capabilities to video analysis, anticipating a future where ChatGPT users could upload video clips for the AI to summarize or highlight key moments, further broadening the application’s utility and appeal. OpenAI pitched the improvement as a potential boon for fashion, video games, and web design. That’s apparent in the video below, where tldraw demonstrates how its new Make Real tool transforms drawings on a digital whiteboard into a website, complete with working code, thanks to GPT-4 Turbo with Vision.
Make Real, built by @tldraw, lets users draw UI on a whiteboard and uses GPT-4 Turbo with Vision to generate a working website powered by real code. pic.twitter.com/RYlbmfeNRZ
— OpenAI Developers (@OpenAIDevs) April 9, 2024
With the update, OpenAI’s model will also have somewhat more up-to-date knowledge to draw on. The information cutoff has advanced from April to December 2023. All of the upgrades available with the API will also become part of ChatGPT. Despite facing stiff competition from newer models like Claude 3 models and Google’s Gemini 1.5, which have shown superior performance in some benchmark tests, OpenAI’s latest update aims to bolster GPT-4 Turbo’s standing with new features that are especially appealing to enterprise customers. The model maintains its 128,000 token context window, allowing for comprehensive analysis equivalent to the content of a 300-page book, catering to a wide range of use cases.
Follow @voicebotaiFollow @erichschwartz
OpenAI Enhances Generative AI Model Fine-Tuning and Custom Model Services
OpenAI Showcases New Generative AI Models and Lower API Prices, Cures GPT-4 ‘Laziness’