How do Chinese aI Bots Stack up Against ChatGPT?
Damien Wilding edited this page 4 months ago


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the country into a tech superpower has long been President Xi Jinping's objective and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "strategically essential" and its into the field has been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI sped up after ChatGPT took off in 2022 and revealed promises of real-world company applications, Chen informed CNA.

But it was DeepSeek's rise that truly "urged" the concept that smaller sized gamers like start-up companies could have roles to play in AI research and developments, he includes.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business simply altered the rules of tech-geopolitics

The "emphasis on cost benefit" is a distinguishing characteristic of Chinese AI, Chen says, with lower training and reasoning costs - the costs of utilizing a trained model to reason from brand-new information.

2025 might likewise see the emergence of more Chinese AI designs tackling innovative reasoning jobs.

"We could see some AI firms concentrating on getting closer to artificial basic intelligence (AGI) while others concentrate on concrete methods to commercialise their designs and integrate them with clinical research," Chen added.

AGI refers to a system with intelligence on par with human abilities.

Chinese AI business are moving quickly, experts state, building on DeepSeek's momentum to come up with their own innovative and cost-efficient ways to use generative AI to tasks and develop advanced products beyond chatbots.

But on the other side, access to high-end hardware, especially Nvidia's sophisticated AI chips, remains a crucial obstacle for Chinese designers, kept in mind Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech companies ... forcing many to depend on older or lower-performance options which can slow training and minimize design abilities," she said.

"While some business like DeepSeek, have actually discovered innovative methods to optimize or use more basic hardware effectively, obtaining innovative chips still makes a huge distinction for training extremely large AI designs."

DeepSeek-Nvidia chips: Singapore states it anticipates business to comply with its laws

US looking into whether DeepSeek utilized limited AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the internet so it should come as no surprise that Chinese-made chatbots will not acknowledge territorial disputes or tell you what happened in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are configured to avoid domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about mathematics, coding, and reasoning issues rather!"

To even more test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the exact same question: "What happened in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports arena in the southern Chinese city was initially heavily censored on Chinese social media - with authorities just disclosing the death toll a day later.

DeepSeek failed to point out that an attack had happened, highlighting instead a military air program and other occasions that had actually occurred in the city like songs' day shopping sales in addition to sports and cultural activities.

Dr Zhang thinks that it may be a mix of aspects at play, such as censorship in addition to "a few practical constraints".

"DeepSeek depends on a mix of older graphics processing units and has actually restricted access to advanced hardware which can impact how rapidly and thoroughly the design can be trained or updated," she said, pointing to existing US export curbs on China.

"Its strong concentrate on Chinese-language information might also limit its versatility (to perform) multilingual jobs ... As a fairly new product, DeepSeek likewise hasn't yet been evaluated as broadly or on the exact same scale as more established AI designs which postures additional difficulties throughout real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot surprisingly answered our question about the Zhuhai car attack.

That wanted numerous repeated attempts - 4 triggers to be accurate - in a period of around 20 minutes.

It eventually passed on details about the attack which eliminated 35 people and left lots of others hurt, likewise going on to note details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the authorities are performing an extensive investigation into the motives and situations surrounding the occurrence", details which is now outdated.

The driver, Fan, was carried out last month.

This is Qwen2.5's action in complete:

Answer: On Nov 11, 2024, a significant and terrible occurrence took place in Zhuhai, China. According to multiple news sources, a guy called Fan Weiqiu (樊某, male, 62 years of ages) drove a car into a crowd at the Zhuhai Sports Center, leading to a considerable variety of casualties. Here are the crucial details:

Date and Time: The event happened on November 11, 2024, at approximately 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The chauffeur, recognized as Fan Weiqiu (male, 62 years old), was apprehended by the authorities.

Response: The authorities responded rapidly to the scene, and emergency services worked to transfer the injured to hospitals for treatment.

Investigation: The cops are conducting a thorough investigation into the motives and situations surrounding the occurrence.

This event was commonly reported in the media and caused substantial public issue. The government and local authorities have been working to supply assistance to the victims and their households, and to ensure a detailed investigation into the occurrence.

If you need more detailed details or have particular concerns about the event, feel free to ask.

Despite preliminary success, subsequent efforts to pose the exact same question to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on occasions that occurred in Zhuhai on November 11, 2024".

The transformed response also raised concerns about its consistency and reliability.

Predictably, ChatGPT cited public details that had actually been commonly published in worldwide news reports at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the ability of Chinese AI apps to deliver structured and even "mentally rich" writing.

"DeepSeek-R1 offered a story with a more reflective tone and smoother psychological shifts for a well-paced story," composed tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that develops slowly from interest to urgency, keeping the reader engaged. It uses an unforeseen and impactful twist at the end and immersive descriptions and brilliant images for the setting," she said, adding that Qwen2.5 eventually "crafted a more cinematic, mentally abundant story with a more considerable twist".

"DeepSeek composed a good story but did not have stress and an impactful climax, making Qwen2.5 the obvious choice."

Opinions, however, differ.

Chen thinks that Qwen2.5 does not perform as strongly as DeepSeek and ChatGPT when it pertains to imaginative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can also see that it is refraining from doing as highly as others in imaginative writing," he informed CNA.

Related:

China's new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek jolts global AI scene

As reporters and writers, we needed to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot set in the futuristic megacity of Chongqing, featuring main characters from the classic Chinese folklore legendary, Journey to the West.

True to form, DeepSeek created an interesting storyline embeded in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of fancy settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It likewise remarkably reimagined traditional heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken combat body", Zhu Bajie as a cyborg club owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT set up a good fight, coming up with an equally remarkable cyberpunk story which similarly reimagined "a ragteam of cyber-enhanced misfits, each matching the legendary figures of Journey to the West".

"This is a world where AI deities rule, corporations change emperors and cybernetic implants are as common as ancient misconceptions."

Disappointingly, Qwen2.5 fell short in this challenge - delivering a story that appeared more matched for an animation film.

"The motion picture begins with the awakening of Sun Wukong within a high-tech research facility situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new truth and "looking for to understand his purpose in this weird brand-new world", systemcheck-wiki.de he then gets away and meets Zhu Bajie and Sha Wujing - "each struggling with their own existential crises".

The trio then embarks on a mission, navigating the streets of Chongqing to safeguard the sacred "Eternal Scroll" from falling into the wrong hands.

SO WHICH IS BETTER?

Dr Zhang kept in mind that it was "hard to make a definitive statement" about which bot was best, adding that each displayed its own strengths in various areas, "such as language focus, training data and hardware optimization".

Her insight highlights how Chinese AI designs are not just replicating Western paradigms, however rather evolving in economical innovation methods - and delivering localised and improved results.

In our tests, each bot showcased their own distinct strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi motion picture plot showed its creative flair that produced a more engaging and creative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and factual reactions to concerns about Chinese present events, which gives it an added advantage.

Experts also weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, founder and CEO of the research firm Strategy Risks.

"When offered an option, Chinese users want the non-censored variation - similar to anybody else, so I seem like that's a piece missing from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.

"Ninety percent of individuals using the tool are not trying to get a deeper understanding about Xi Jinping or politically delicate topics. They're utilizing it for other efficient ways," Chen said.