Kuwait Times

‘Strawberry’: OpenAI working on new reasoning technology

-

WASHINGTON: ChatGPT maker OpenAI is working on a novel approach to its artificial intelligen­ce models in a project code-named “Strawberry,” according to a person familiar with the matter and internal documentat­ion reviewed by Reuters. The project, details of which have not been previously reported, comes as the Microsoft-backed startup races to show that the types of models it offers are capable of delivering advanced reasoning capabiliti­es.

Teams inside OpenAI are working on Strawberry, according to a copy of a recent internal OpenAI document seen by Reuters in May. Reuters could not ascertain the precise date of the document, which details a plan for how OpenAI intends to use Strawberry to perform research. The source described the plan to Reuters as a work in progress. The news agency could not establish how close Strawberry is to being publicly available. How Strawberry works is a tightly kept secret even within OpenAI, the person said. The document describes a project that uses Strawberry models with the aim of enabling the company’s AI to not just generate answers to queries but to plan ahead enough to navigate the Internet autonomous­ly and reliably to perform what OpenAI terms “deep research,” according to the source.

This is something that has eluded AI models to date, according to interviews with more than a dozen AI researcher­s. Asked about Strawberry and the details reported in this story, an OpenAI company spokespers­on said in a statement: “We want our AI models to see and understand the world more like we do. Continuous research into new AI capabiliti­es is a common practice in the industry, with a shared belief that these systems will improve in reasoning over time.”

The spokespers­on did not directly address questions about Strawberry. The Strawberry project was formerly known as Q*, which Reuters reported last year was already seen inside the company as a breakthrou­gh. Two sources described viewing earlier this year what OpenAI staffers told them were Q* demos,

capable of answering tricky science and math questions out of reach of today’s commercial­ly-available models.

A different source briefed on the matter said OpenAI has tested AI internally that scored over 90% on a MATH dataset, a benchmark of championsh­ip math problems. Reuters could not determine if this was the “Strawberry” project. On Tuesday at an internal allhands meeting, OpenAI showed a demo of a research project that it claimed had new human-like reasoning skills, according to Bloomberg. An OpenAI spokespers­on confirmed the meeting but declined to give details of the contents. Reuters could not determine if the project demonstrat­ed was Strawberry.

OpenAI hopes the innovation will improve its AI models’ reasoning capabiliti­es dramatical­ly, the person familiar with it said, adding that Strawberry involves a specialize­d way of processing an AI model after it has been pre-trained on very large datasets.

Researcher­s Reuters interviewe­d say that reasoning is key to AI achieving human or super-human-level intelligen­ce. While large language models can already summarize dense texts and compose elegant prose far more quickly than any human, the technology often falls short on common sense problems whose solutions seem intuitive to people, like recognizin­g logical fallacies and playing tic-tac-toe. When the model encounters these kinds of problems, it often “hallucinat­es” bogus informatio­n. AI researcher­s interviewe­d by Reuters generally agree that reasoning, in the context of AI, involves the formation of a model that enables AI to plan ahead, reflect how the physical world functions, and work through challengin­g multi-step problems reliably. Improving reasoning in AI models is seen as the key to unlocking the ability for the models to do everything from making major scientific discoverie­s to planning and building new software applicatio­ns.

OpenAI CEO Sam Altman said earlier this year that in AI “the most important areas of progress will be around reasoning ability.” Other companies like Google, Meta and Microsoft are likewise experiment­ing with different techniques to improve reasoning in AI models, as are most academic labs that perform AI research. Researcher­s differ, however, on whether large language models (LLMs) are capable of incorporat­ing ideas and long-term planning into how they do prediction. For instance, one of the pioneers of modern AI, Yann LeCun, who works at Meta, has frequently said that LLMs are not capable of humanlike reasoning.

 ?? – AFP ?? GENEVA: OpenAI CEO Sam Altman (right) appears on a giant screen, speaking remotely during a keynote with Nicholas Thompson, CEO of The ITU AI for Good Global Summit, in Geneva, in this file photo.
– AFP GENEVA: OpenAI CEO Sam Altman (right) appears on a giant screen, speaking remotely during a keynote with Nicholas Thompson, CEO of The ITU AI for Good Global Summit, in Geneva, in this file photo.

Newspapers in English

Newspapers from Kuwait