models DeepSeek-V4 Pro and Flash: The Open-Source Models Challenging GPT-5.4 and Claude
DeepSeek-V4 Pro with 1.6 trillion parameters and Flash with 284 billion — a detailed look at the hybrid attention architecture and benchmark performance against GPT-5.4 and Claude Opus 4.6