OpenAI crosses o2 and releases model 'Reasoning' o3 new

The final day of OpenAI's “12 Days of Shipmas” has arrived with the launch of the o3 model. It's a new chain of “reasoning” that the company claims is the most advanced. This model is not yet available for general use. But security researchers can. Register to see a preview Starting today

OpenAI and others hope the reasoning model will help solve the serious problem of chatbots often giving wrong answers. Essentially, chatbots don't “think” like humans and require different techniques. To try to create the best possible model of the human thought process.

When asking a question The reasoning model pauses and considers relevant prompts that may help generate the correct answer. For example, if you ask the o3 model, “Can habaneros be grown in the Pacific Northwest?” Several questions can be researched to reach a conclusion, such as “Where do habaneros typically grow?” “What are the ideal conditions for growing habaneros?” and “What type of climate does the Pacific Northwest have?” ” Anyone who has ever used a chatbot knows that sometimes you need to prompt the chatbot to follow up further until it finally gets the correct result. The reasoning model should do this additional work for you.

o3 is the successor to o1, OpenAI's first mind-chain reasoning model. A representative said they decided to skip the “o2” naming convention “out of respect” for the British telecommunications company. But it doesn't hurt that the product is more advanced. The company says the new model comes with the ability to adjust reasoning times. Users can choose low, medium, or high reasoning times. The more computations performed, the better o3 should perform. OpenAI says it will “red team” new models with researchers to prevent that from happening. causing a potentially dangerous response (Again, I am not human and do not know right from wrong.)

Reasoning is the buzzword these days in generative AI, as industry insiders believe it is the next unlock needed to improve the performance of large-scale language models. More processing power does not provide the same increase in performance. So new techniques are needed. Google DeepMind recently released its own reasoning model called Gemini in-depth researchIt can take 5-10 minutes to generate a report that analyzes multiple sources across the web to arrive at the findings.

OpenAI has confidence in o3 and has impressive benchmarks. It said that on the Codeforcing test, which measures coding ability, o3 scored a 2,727. For context, a score of 2,400 would put engineers in the 99th percentile of programmers. On the 2024 American Invitational Mathematics exam, they scored 96.7%. There was only one question missing. We'll have to see how this model holds up in real-world tests. And it's generally not a good idea to rely too heavily on AI models for critical tasks that require accuracy. But optimists are confident that the authenticity issue is being resolved. I hope so. This is because Google's overall AI in search remains a frequent topic of ridicule on social media.

AI prototype companies like OpenAI and Perplexity are racing to be the next Google, pooling knowledge from around the world and helping users make sense of it all. They also have a search product now that aims to more directly mirror Google. Access real-time web results–

These players seem to be making leaps and bounds every day. This feeling is quite reminiscent of the late 1990s when there were many search engines to choose from such as Google, Yahoo and AltaVista, Ask Jeeves and many more, all gathering information from the internet and presenting it with a distinct UX. together Most of it disappeared after someone came along that was far better than the rest: Google.

OpenAI is the clear leader right now, with hundreds of millions of monthly active users and a partnership with Apple, but Google has received a lot of praise recently for its advancements in the Gemini model. The Verge reports that the company is about to integrate Gemini. together soon Let's dive deeper into its search interface.–

Source link

Leave a ReplyCancel Reply