Deep Research di OpenAI: l’IA che ridefinisce i limiti dell’AI

OpenAI ha recentemente annunciato che il suo agente AI, Deep Research, ha raggiunto un’accuratezza del 26,6% nel benchmark “Humanity’s Last Exam”, superando modelli precedenti come ChatGPT o3-mini e DeepSeek R1.

“Humanity’s Last Exam” è un benchmark progettato per valutare le capacità delle intelligenze artificiali attraverso una serie di domande complesse che spaziano dalla matematica alle scienze umane e naturali. Inizialmente, i modelli AI raggiungevano un’accuratezza intorno al 9%, ma Deep Research ha significativamente superato queste prestazioni.

Deep Research è un agente AI avanzato che combina capacità di ricerca sul web con potenti strumenti di analisi, permettendo una comprensione e una sintesi delle informazioni più approfondite rispetto ai modelli precedenti. Questa integrazione di ricerca e analisi consente a Deep Research di fornire risposte più accurate e dettagliate.

Nonostante questi progressi, è importante notare che Deep Research ha accesso a funzionalità di ricerca che altri modelli non possiedono, il che potrebbe influenzare i confronti diretti. Tuttavia, il rapido miglioramento delle prestazioni evidenzia l’accelerazione nello sviluppo delle capacità delle intelligenze artificiali.

Questo risultato solleva anche questioni riguardanti la progettazione di benchmark per valutare le AI. Se i nuovi test vengono superati troppo rapidamente, potrebbe essere necessario rivedere gli standard di valutazione per mantenere una misura accurata dei progressi nel campo dell’intelligenza artificiale.

In conclusione, l’avanzamento di Deep Research rappresenta un passo significativo nel campo dell’intelligenza artificiale, dimostrando la capacità dei modelli AI di affrontare compiti sempre più complessi e di migliorare rapidamente le proprie prestazioni.

Cookie	Durata	Descrizione
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Durata	Descrizione
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_198202384_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Durata	Descrizione
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Deep Research di OpenAI: l’IA che ridefinisce i limiti dell’AI

NEWS AIopenmind su:

Iscrizione NEWSLETTER

Visita le sezioni del sito

Link utili

Media Partner