Deepseek Benchmark - Search News

News

16hon MSN

Alibaba's Upgraded Qwen3 Outperforms OpenAI, DeepSeek

Alibaba unveiled a powerful upgrade to its Qwen3 large language model family, according to Benzinga. The upgrade enhances ...

Alibaba's Latest AI Model Outperforms ChatGPT, DeepSeek

Alibaba's Qwen3 model outperforms rivals in AI benchmarks, with improved capabilities in math, coding, and reasoning. Nvidia ...

13d

DeepSeek And The Future Of Enterprise AI

The implications for enterprise AI are significant. Until recently, most leading systems were only available through closed ...

Alibaba upgrades flagship Qwen3 model to outperform OpenAI, DeepSeek in maths, coding

Alibaba Group Holding unveiled an upgraded version of its third-generation Qwen3 family of large language models (LLMs), ...

Hosted on MSN3mon

DeepSeek V3-0324 Sets New Benchmark in Open-Source AI Models

DeepSeek V3-0324 is no exception. As an open-source AI model, it is not only pushing the boundaries of what we imagine AI can achieve but also setting new standards for community-driven innovation ...

EurekAlert!2mon

Benchmark performance of DeepSeek-R1 (IMAGE)

DeepSeek-R1 demonstrates strong performance across multiple educational and reasoning benchmarks. It achieves 79.8% Pass@1 on AIME 2024, slightly surpassing OpenAI-o1-1217. On MATH-500, it reaches ...

World Socialist Web Site5mon

China’s DeepSeek model is a major advance in AI technology

DeepSeek’s performance meets or exceeds that of state-of-the-art AI models from American companies such as Meta and Open AI, surpassing all open-source models previously available and many ...

Tom's Guide3mon

I tested DeepSeek vs Gemini 2.5 with 9 prompts — here's the winner

Gemini 2.5 reportedly leads in math and science benchmarks, scoring 18.8% on Humanity’s Last Exam, a dataset designed to assess AI’s ability to handle complex knowledge-based questions.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results