DeepSeek R1 GPU Efficiency

News

14don MSN

DeepSeek debuts lighter R1 AI model with better math reasoning

Despite its smaller size, DeepSeek-R1-0528-Qwen3-8B beats Google’s Gemini 2.5 Flash on a tough math test called AIME 2025 and ...

16d

Atlas Cloud Launches High-Efficiency AI Inference Platform, Outperforming DeepSeek

Developed with SGLang, Atlas Inference surpasses leading AI companies in throughput and cost, running DeepSeek V3 & R1 faster than DeepSeek themselves.

15d

DeepSeek R1-0528 arrives in powerful open source challenge to OpenAI o3 and Google Gemini 2.5 Pro

Additionally, the model’s hallucination rate has been reduced, contributing to more reliable and consistent output.

Nasdaq14d

Aurora Mobile Integrates DeepSeek-R1-0528 AI Model to Enhance GPTBots.ai Platform

The integration of DeepSeek-R1-0528 improves key benchmark performance metrics, such as accuracy and coding performance, which can provide clients with more precise and efficient AI solutions for ...

14d

Aurora Mobile's GPTBots.ai Welcomes the Enhanced DeepSeek-R1-0528 Model to Power Enterprise AI Solutions

Aurora Mobile Limited (NASDAQ: JG) ("Aurora Mobile" or the "Company"), a leading provider of customer engagement and marketing technology services in China, today announced the integration of newly ...

Raleigh News & Observer16d

Atlas Cloud Launches High-Efficiency AI Inference Platform, Outperforming DeepSeek

Developed with SGLang, Atlas Inference surpasses leading AI companies in throughput and cost, running DeepSeek V3 & R1 faster than ... maximizes GPU efficiency by processing more tokens faster ...

CNBC Africa15d

China’s DeepSeek quietly releases upgraded R1 AI model, ramping up competition with OpenAI

Chinese firm DeepSeek released ... their AI models more efficient to deal with U.S. semiconductor export curbs. Jensen Huang, CEO of Nvidia, which designs the graphics processing units required ...

manilatimes15d

Aurora Mobile's GPTBots.ai Welcomes the Enhanced DeepSeek-R1-0528 Model to Power Enterprise AI Solutions

These enhancements empower GPTBots.ai users to tackle complex tasks in domains like math, science, business, and programming with greater precision and efficiency ... GPU memory, making it accessible ...

Miami Herald16d

Atlas Cloud Launches High-Efficiency AI Inference Platform, Outperforming DeepSeek

Developed with SGLang, Atlas Inference surpasses leading AI companies in throughput and cost, running DeepSeek V3 & R1 faster than DeepSeek ... dramatically reduces GPU and server requirements ...

Morningstar15d

Aurora Mobile's GPTBots.ai Welcomes the Enhanced DeepSeek-R1-0528 Model to Power Enterprise AI Solutions

DeepSeek-R1-0528-Qwen3-8B, optimized for smaller-scale applications. This variant achieves state-of-the-art performance among open-source models while requiring only 16 GB of GPU memory ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results