DeepSeek releases R1 model trained for $294,000 on 512 H800 GPUs
The Chinese company DeepSeek AI has released its large language model, R1, which was trained for only $294,000 using 512 Nvidia H800 GPUs.
In a paper published in the journal Nature, the company detai...