US Researchers Launch A DeepSeek Competitor

A small team of researchers at Stanford and Washington Universities have created an advanced and very significant AI reasoning model, named s1, for an incredibly low cost of under $50. 

This is highly significant in an industry where developing similar models takes many millions of dollars in resource and infrastructure costs at a time of  growing competition in the AI reasoning field.

For the purpose of comparion, Chinese startup DeepSeek recently made a big impact with its own reasoning model, R1, which the company claims to have been developed for just $6 million.

The s1 model can complete complex reasoning tasks, and has performed in similar ways to OpenAI’s o1 and DeepSeek’s R1 with maths and coding. However, critics are questioning the accuracy of DeepSeek’s claims, and also expressed their concerns regarding the safety and security of its models.

Low Cost Of s1’s Development

This process involves training s1 to mimic the reasoning abilities of an existing AI model, specifically, Google’s Gemini 2.0 Flash Thinking experimental model. By using a curated dataset of 1,000 questions and answers, paired with reasoning traces from the Gemini model, s1 learned how to arrive at accurate solutions in a fraction of the time and cost compared to traditional methods.

According to the researchers, training s1 took just 26 minutes using 16 Nvidia H100 GPUs, costing just $20 in total.

The researchers used what they call Supervised Fine-Tuning (SFT), a method that involves guiding the model with explicit instructions to accelerate the learning process. One particularly interesting development during s1’s creation was the introduction of a “wait” instruction, which helped improve its accuracy. By incorporating pauses into the model’s reasoning process, the researchers found that s1 was able to double-check its responses, often correcting errors and leading to more accurate conclusions.

The researchers behind s1 hope their work will drive open innovation, making powerful reasoning models more accessible to the global community and accelerating advancements in AI technology for the benefit of society. 

However, a higher level of investment may still be necessary to push the envelope of AI innovation. 

The shrort-cut methods used by s1 and R1 (sometimes referred to as distillation) are demonstrably a good method for cheaply re-creating an AI model’s capabilities, but they don’t create new AI models vastly better than what is already available.

arXiv   |    I-HLS    |   Interesting Engineering     |  Tech Xplore   |  Mashable  | Tech Crunch   |   Yahoo

Image: Igor Kutyaev

You Might Also Read: 

A History Of Artificial Intelligence: Its Current & Future Development:


If you like this website and use the comprehensive 6,500-plus service supplier Directory, you can get unrestricted access, including the exclusive in-depth Directors Report series, by signing up for a Premium Subscription.

  • Individual £5 per month or £50 per year. Sign Up
  • Multi-User, Corporate & Library Accounts Available on Request

Cyber Security Intelligence: Captured Organised & Accessible


 

« Thai Police Arrest Russian Hackers
Business Interruption Is The #1 Cyber Risk »

CyberSecurity Jobsite
Check Point

Directory of Suppliers

NordLayer

NordLayer

NordLayer is an adaptive network access security solution for modern businesses — from the world’s most trusted cybersecurity brand, Nord Security. 

LockLizard

LockLizard

Locklizard provides PDF DRM software that protects PDF documents from unauthorized access and misuse. Share and sell documents securely - prevent document leakage, sharing and piracy.

Jooble

Jooble

Jooble is a job search aggregator operating in 71 countries worldwide. We simplify the job search process by displaying active job ads from major job boards and career sites across the internet.

CSI Consulting Services

CSI Consulting Services

Get Advice From The Experts: * Training * Penetration Testing * Data Governance * GDPR Compliance. Connecting you to the best in the business.

Alvacomm

Alvacomm

Alvacomm offers holistic VIP cybersecurity services, providing comprehensive protection against cyber threats. Our solutions include risk assessment, threat detection, incident response.

Redbud

Redbud

Redbud is a specialist search and recruitment firm for Information Security professionals.

Conscia

Conscia

Conscia provides IT infrastructure solutions and 24/7 services in network, data center, security and mobility.

Dionach

Dionach

Dionach are a certified information security specialists who provide Penetration Testing, IT Security Auditing and Information Security Consultancy.

United Security Providers

United Security Providers

United Security Providers is a leading specialist in information security, protecting IT infrastructures and applications for companies with high demands on security.

Netrix

Netrix

Netrix is a Mexican company specialized in IT Security, with more than 18 years of experience in Managed Services, Professional Services and Turnkey Solutions related to Security.

BlueKrypt

BlueKrypt

BlueKrypt is a consulting firm for the security of IT systems and their management.

Propelo

Propelo

Propelo (formerly LevelOps) is an engineering excellence platform that helps increase developer productivity and improve security with data-led insights and workflow automation.

Privacera

Privacera

Privacera enables consistent data governance, security, and compliance across all your data services - on-premises and in the cloud - so you can maximize the value of your data.

TransUnion

TransUnion

TransUnion is a global information and insights company that makes it possible for businesses and consumers to transact with confidence.

BDO Global

BDO Global

BDO is an international network of public accounting, tax and advisory firms which perform professional services under the name of BDO.

META-Cyber

META-Cyber

META-cyber was founded by engineers with experience in process and control-protection to provide cyber security for industrial infrastructure.

Def-Logix

Def-Logix

Def-Logix was founded in 2008 to help solve cyber threats being experienced by government agencies of the United States.

Rootly

Rootly

Rootly is an incident management platform on Slack that helps automate manual admin work during incidents.

StepSecurity

StepSecurity

StepSecurity provides a comprehensive security platform for GitHub Actions.

Phone Monitoring Service

Phone Monitoring Service

Phone Monitoring Service provides cyber security services, ethical hacking services, social media hacking services in the USA, Canada, Europe.

Qwiet AI

Qwiet AI

At Qwiet AI we enable you to prevent cyberattacks by securing code from the start. Secure code in three steps.