Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

Gemini 2.5: Updated Thinking Model Family

June 18, 2025

A collaborative effort to maintain application resilience

June 17, 2025

Samsung R&D Institute, IIT Madras signs MOU to promote research in AI such as Indian language, HealthTech | Education

June 17, 2025
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Wednesday, June 18
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
Versa AI hub
Home»Research»“Compute scaling during testing” is the path to better AI systems
Research

“Compute scaling during testing” is the path to better AI systems

By December 18, 2024No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email


summary

Taking inspiration from OpenAI’s o1 model, Hugging Face researchers have demonstrated that intelligently scaling compute during inference can significantly improve the performance of open source language models. Their approach combines different search strategies and reward models.

Scaling computing resources during pre-training is critical to the development of large-scale language models (LLMs) in recent years, but the required resources have become increasingly expensive and researchers are exploring alternative approaches. Masu. Scaling computing power during inference offers a promising solution by using dynamic inference strategies that allow models to spend more time processing complex tasks, according to Hugging Face researchers. I will.

While “computational scaling during testing” is nothing new and has been a key factor in the success of AI systems like AlphaZero, OpenAI’s o1 significantly improves language model performance by increasing “think” time. For the first time, we have clearly demonstrated what can be done. “About a difficult task. However, there are several possible implementation approaches, and it remains unclear which one OpenAI will use.

From basic to complex search strategies

Scientists considered three main search-based approaches. The “Best-of-N” method generates multiple solution proposals and selects the best one. Beam Search uses a process reward model (PRM) to systematically explore the solution space. The newly developed “Diverse Verifier Tree Search” (DVTS) further optimizes the diversity of solutions found.

advertisement

THE DECODER Newsletter

Get the most important AI news delivered straight to your inbox.

✓ Weekly

✓ Free

✓ Cancel anytime

The actual test results are impressive. An Llama model with just 1 billion parameters matched the performance of a model 8 times larger. In the mathematical task, we achieved almost 55% accuracy. This is close to the average performance of computer science doctoral students, Hugging Faith said.

Image faces hugging each other

share

Recommend our article

share

The 3 billion parameter model outperformed the 70 billion parameter Llama 3.1, which is 22 times larger, thanks to the team’s proposed optimized computational method that selects the best search strategy for each computational budget.

Image faces hugging each other

In both cases, the team compared the results of a small-scale model that used the inference techniques to the results of a large-scale model that did not use these techniques.

Verifiers play an important role

Validators or reward models play a central role in all these approaches. Evaluate the quality of the generated solutions and guide your search towards promising candidates. However, according to the team, benchmarks like ProcessBench show that current verification tools still have weaknesses, especially when it comes to robustness and versatility.

Improving verification capabilities is therefore an important starting point for future research, but the ultimate goal is a model that can autonomously verify its own output, and the team believes OpenAI’s o1 will do that. suggests.

recommendation

Nvidia's DrEureka uses GPT-4 to automate robot skill transfer from simulation to reality

Nvidia's DrEureka uses GPT-4 to automate robot skill transfer from simulation to reality

More information and some of the tools used are available at Hugging Face.

author avatar
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHCLTech launches SAP Business AI innovation lab in Germany to enable digital transformation for clients
Next Article The future of media search and personalization is in AI | Comments

Related Posts

Research

Samsung R&D Institute, IIT Madras signs MOU to promote research in AI such as Indian language, HealthTech | Education

June 17, 2025
Research

Riwi Corp announces synthetic data solutions to transform market research and AI

June 17, 2025
Research

Startups raise $17 million Series A to help businesses

June 16, 2025
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Piclumen Art V1: Next Generation AI Image Generation Model Launches for Digital Creators | Flash News Details

June 5, 20253 Views

Presight plans to expand its AI business internationally

April 14, 20252 Views

PlanetScale Vectors GA: MySQL and AI Database Game Changer

April 14, 20252 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Piclumen Art V1: Next Generation AI Image Generation Model Launches for Digital Creators | Flash News Details

June 5, 20253 Views

Presight plans to expand its AI business internationally

April 14, 20252 Views

PlanetScale Vectors GA: MySQL and AI Database Game Changer

April 14, 20252 Views
Don't Miss

Gemini 2.5: Updated Thinking Model Family

June 18, 2025

A collaborative effort to maintain application resilience

June 17, 2025

Samsung R&D Institute, IIT Madras signs MOU to promote research in AI such as Indian language, HealthTech | Education

June 17, 2025
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2025 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?