chinaglobalpulse
We are on social networks
Subscribe to our public pages and channels on social media and stay up-to-date with the latest news.
Стоимость обучения модели DeepSeek: что стоит за $294 000

Cost of Training the DeepSeek Model: The Truth About $294,000

The cost of training the DeepSeek model is reported at $294,000. We analyze included costs, hidden expenses, and comparisons with other AI models.

Share your love

CNN reported that the cost of training the DeepSeek model was only $294,000, which caused a stir in the AI industry. For comparison, similar Western models can cost tens or even hundreds of millions of dollars. At first glance, this figure seems incredibly low, but it’s essential to understand what is included in this cost and what remains unaccounted for to grasp the real cost of training the DeepSeek model.


What We Know About the Reported Cost

According to the company:

  • Training the R1 model cost $294,000;
  • A cluster of 512 Nvidia H800 GPUs was used;
  • The final training stage lasted around 80 hours;
  • Other GPUs, including Nvidia A100, were used in preliminary stages before switching to H800.

These numbers reflect only the final stage of training and do not represent the full development costs.


What Is Not Included in the Official Figure

The reported $294,000 does not account for:

  1. Pre-training of the base model before R1 development.
  2. Data collection and cleaning, including annotation.
  3. Research work, such as architecture design, prototypes, and experiments.
  4. Infrastructure and operational costs, including electricity, cooling, and equipment rental.
  5. Salaries of engineers and researchers involved in the project.

Experts suggest that the real cost of training the DeepSeek model is significantly higher.


Table: Comparison of DeepSeek Costs with Competitors

Model / ProjectReported or Estimated CostNotes
DeepSeek R1 (official)~$294,000Final stage on H800, 80 hours
DeepSeek V3 (full estimate)~$5.5–6 millionIncludes all development stages and pre-training
GPT-4 (OpenAI)>$100 millionFull development cycle, infrastructure, and data costs
LLaMA (Meta)Tens of millionsFull pre-training and infrastructure included

Why This Figure Sparked Discussion

  • Contrast with Western costs: $294,000 is extremely low compared to hundreds of millions of dollars.
  • Different calculation methods: DeepSeek reported only the final training stage.
  • Political and technological implications: China demonstrates the ability to build competitive models even with limited access to top-tier GPUs.
  • Market impact: If confirmed, this approach could lower entry barriers for startups and academic institutions.

Conclusion

The cost of training the DeepSeek model at $294,000 highlights efficiency and optimization but represents only a limited stage of development. The true cost, including pre-training, infrastructure, and research, is likely in the millions. Still, even this relatively low figure demonstrates a trend toward cost optimization in the AI industry.


FAQ Frequently Asked Questions

1. Is the cost of training the DeepSeek model really $294,000?
Yes, this is the officially reported figure, but it only covers the final training stage.

2. Why is this cost so different from GPT-4 and other models?
Western companies include the full development cycle, whereas DeepSeek reported only a portion of the training costs.

3. What is the real cost of training the DeepSeek model?
Experts estimate the full cost at around $5–6 million.

4. What is included in the hidden expenses?
Pre-training, data preparation, architecture research, salaries, infrastructure, and electricity.

5. What does this mean for the future of AI?
If DeepSeek’s methods are validated, training language models could become cheaper and more accessible to more companies and startups.


Source: CNN

Share your love

Leave a Reply

Your email address will not be published. Required fields are marked *

Stay informed and not overwhelmed, subscribe now!