US Researchers Launch A DeepSeek Competitor

A small team of researchers at Stanford and Washington Universities have created an advanced and very significant AI reasoning model, named s1, for an incredibly low cost of under $50. 

This is highly significant in an industry where developing similar models takes many millions of dollars in resource and infrastructure costs at a time of  growing competition in the AI reasoning field.

For the purpose of comparion, Chinese startup DeepSeek recently made a big impact with its own reasoning model, R1, which the company claims to have been developed for just $6 million.

The s1 model can complete complex reasoning tasks, and has performed in similar ways to OpenAI’s o1 and DeepSeek’s R1 with maths and coding. However, critics are questioning the accuracy of DeepSeek’s claims, and also expressed their concerns regarding the safety and security of its models.

Low Cost Of s1’s Development

This process involves training s1 to mimic the reasoning abilities of an existing AI model, specifically, Google’s Gemini 2.0 Flash Thinking experimental model. By using a curated dataset of 1,000 questions and answers, paired with reasoning traces from the Gemini model, s1 learned how to arrive at accurate solutions in a fraction of the time and cost compared to traditional methods.

According to the researchers, training s1 took just 26 minutes using 16 Nvidia H100 GPUs, costing just $20 in total.

The researchers used what they call Supervised Fine-Tuning (SFT), a method that involves guiding the model with explicit instructions to accelerate the learning process. One particularly interesting development during s1’s creation was the introduction of a “wait” instruction, which helped improve its accuracy. By incorporating pauses into the model’s reasoning process, the researchers found that s1 was able to double-check its responses, often correcting errors and leading to more accurate conclusions.

The researchers behind s1 hope their work will drive open innovation, making powerful reasoning models more accessible to the global community and accelerating advancements in AI technology for the benefit of society. 

However, a higher level of investment may still be necessary to push the envelope of AI innovation. 

The shrort-cut methods used by s1 and R1 (sometimes referred to as distillation) are demonstrably a good method for cheaply re-creating an AI model’s capabilities, but they don’t create new AI models vastly better than what is already available.

arXiv   |    I-HLS    |   Interesting Engineering     |  Tech Xplore   |  Mashable  | Tech Crunch   |   Yahoo

Image: Igor Kutyaev

You Might Also Read: 

A History Of Artificial Intelligence: Its Current & Future Development:


If you like this website and use the comprehensive 6,500-plus service supplier Directory, you can get unrestricted access, including the exclusive in-depth Directors Report series, by signing up for a Premium Subscription.

  • Individual £5 per month or £50 per year. Sign Up
  • Multi-User, Corporate & Library Accounts Available on Request

Cyber Security Intelligence: Captured Organised & Accessible


 

« Thai Police Arrest Russian Hackers
Business Interruption Is The #1 Cyber Risk »

CyberSecurity Jobsite
Check Point

Directory of Suppliers

Clayden Law

Clayden Law

Clayden Law advise global businesses that buy and sell technology products and services. We are experts in information technology, data privacy and cybersecurity law.

MIRACL

MIRACL

MIRACL provides the world’s only single step Multi-Factor Authentication (MFA) which can replace passwords on 100% of mobiles, desktops or even Smart TVs.

NordLayer

NordLayer

NordLayer is an adaptive network access security solution for modern businesses — from the world’s most trusted cybersecurity brand, Nord Security. 

The PC Support Group

The PC Support Group

A partnership with The PC Support Group delivers improved productivity, reduced costs and protects your business through exceptional IT, telecoms and cybersecurity services.

ZenGRC

ZenGRC

ZenGRC (formerly Reciprocity) is a leader in the GRC SaaS landscape, offering robust and intuitive products designed to make compliance straightforward and efficient.

E-Tech

E-Tech

E-Tech has been providing system support and information technology consulting services including Internet and Network Security assessments.

Security Compass

Security Compass

Security Compass, the Security by Design Company, enables organizations to shift left and build secure applications by design, integrated directly with existing DevSecOps tools and workflows.

European Cybercrime Training and Education Group (ECTEG)

European Cybercrime Training and Education Group (ECTEG)

The primary aim of ECTEG is to enhance the coordination of cybercrime training, by identifying opportunities to build the capacity of countries to combat cybercrime

Calian Group

Calian Group

Calian is a diverse Canadian company offering professional services in areas including Advanced Technologies, Health, Learning and IT & Cyber Solutions.

Basque Digital Innovation Hub (BDIH)

Basque Digital Innovation Hub (BDIH)

The aim of the BDIH initiative is to provide industrial enterprises, especially SMEs, with the technological capabilities needed to meet the challenges of industry 4.0.

Gulf Business Machines (GBM)

Gulf Business Machines (GBM)

GBM is a leading end-to-end digital solutions provider, offering the broadest portfolio, including industry-leading digital infrastructure, digital business solutions, security and services.

VMware

VMware

VMware is a leading provider of multi-cloud services for all apps, enabling digital innovation with enterprise control.

ASPIA InfoTech

ASPIA InfoTech

ASPIA Infotech is a leading Information and cybersecurity organization focused on innovative approaches to avert targeted attacks.

Blue Bastion

Blue Bastion

Don’t give cybercriminals the chance to find weaknesses in your company’s cyber security system. Defend your institution from all attacks from all directions with Blue Bastion.

SignMyCode

SignMyCode

SignMyCode is a one-stop shop for trusted and authentic code signing solutions to safeguard software.

Edge Security

Edge Security

Edge Security is an information security research and consulting firm of expert hackers.

Next DLP

Next DLP

Next DLP (formerly Jazz Networks) is a leading provider of insider risk and data protection solutions.

GoCloud Systems

GoCloud Systems

GoCloud is an IT consulting firm. We provide IT strategy and cloud adoption services to the New Zealand Government, Non-Profit Organisations and private industry.

Vortacity Cyber

Vortacity Cyber

Vortacity is a boutique cybersecurity provider specializing in associations, nonprofits, and mission-based organizations.

Halo Security

Halo Security

Halo Security is a fast, easy, and scalable external attack surface management platform that gives security leaders deep visibility into their internet-facing assets.

Octopus Cybersecurity

Octopus Cybersecurity

Octopus VAR is a Validation, Analysis and Reporting tool that gives risk managers and CISOs a powerful control mechanism and a deep view of operational risks.