'Better than DeepSeek and OpenAI': Alibaba touts open-source AI model that beats rivals

Unlock stock picks and a broker-level newsfeed that powers Wall Street. Upgrade Now

South China Morning Post

Thu, Mar 6, 2025, 1:30 AM 3 min read

Despite its relatively modest 32 billion parameters, Alibaba's new model matched or outperformed DeepSeek's R1, which boasts 671 billion parameters, in areas such as mathematics, coding and general problem-solving, according to a blog post by the team responsible for Alibaba's Qwen family of AI models.

Do you have questions about the biggest topics and trends from around the world? Get the answers with SCMP Knowledge, our new platform of curated content with explainers, FAQs, analyses and infographics brought to you by our award-winning team.

A smaller parameter count enables the model to operate with reduced computing resource requirements, facilitating wider adoption, according to the team.

The lean design of Alibaba's model aligns with the views expressed by Alibaba chairman Joe Tsai in his recent column for the Post, where he emphasised that practical applications were key to maximising intelligence in AI model development.

The release of Alibaba's latest reasoning model - a type of AI system designed to think, reflect and self-critique to solve complex problems - comes less than two months after DeepSeek's R1 shook the global tech industry and stock markets in January.

It also coincides with a surge in AI adoption across China, with Alibaba announcing last month a plan to invest US$52 billion in cloud computing and AI infrastructure over the next three years, marking the largest-ever computing project financed by a single private business in the country.

Alibaba also said that QwQ-32B outperformed OpenAI's o1-mini, which was built with 100 billion parameters. QwQ-32B is available on Hugging Face, the world's largest open-source AI model community.

The Qwen team attributed the performance improvements of its new reasoning model to reinforcement learning techniques, similar to those used by DeepSeek in developing its R1 model.

These advancements "not only demonstrate the transformative potential of reinforcement learning but also pave the way for further innovations in the pursuit of artificial general intelligence", the team said.

Terms and Privacy Policy

Your Privacy Choices

Utah Privacy Notice

More Info

News

Life

Entertainment

Finance

Sports

New on Yahoo

Yahoo Finance

In This Article:

Recommended Stories