DeepSeek v3 benchmarks comparably to Claude 3.5 Sonnet, indicating that it's now possible to train a frontier-class model (at least for the 2024 version of the frontier) for less than $6 million! DeepSeek also announced their API pricing. From February 8th onwards: Input: $0.27/million tokens ($0.07/million tokens with cache hits) Output: $1.10/million tokens Claude 3.5 Sonnet is currently $3/million for input and $15/million for output, so if the models are indeed of equivalent quality this is a dramatic new twist in the ongoing LLM pricing wars.
https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/113/720/626/246/615/353/original/6e8805452ed02ee9.png