How do Chinese aI Bots Stack up Against ChatGPT?
margaritaaubry edited this page 4 months ago


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has long been President Xi Jinping's objective and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically essential" and its foray into the field has actually been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI sped up after ChatGPT removed in 2022 and revealed pledges of real-world service applications, Chen informed CNA.

But it was DeepSeek's rise that actually "urged" the concept that smaller gamers like start-up companies could have functions to play in AI research study and developments, he includes.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business just altered the rules of tech-geopolitics

The "focus on cost benefit" is a distinguishing characteristic of Chinese AI, Chen says, with lower training and inference costs - the expenses of utilizing a trained model to reason from new information.

2025 might likewise see the emergence of more Chinese AI designs tackling sophisticated thinking tasks.

"We could see some AI firms focusing on getting closer to artificial basic intelligence (AGI) while others focus on concrete methods to commercialise their models and incorporate them with scientific research," Chen included.

AGI describes a system with intelligence on par with human capabilities.

Chinese AI business are moving quickly, experts state, building on DeepSeek's momentum to come up with their own ingenious and economical ways to use generative AI to jobs and establish more sophisticated products beyond chatbots.

But on the other side, access to high-end hardware, particularly Nvidia's sophisticated AI chips, remains an essential difficulty for Chinese designers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the ability of Chinese tech companies ... requiring numerous to depend on older or lower-performance alternatives which can slow training and decrease design capabilities," she said.

"While some business like DeepSeek, have actually found innovative ways to optimize or use more basic hardware effectively, obtaining advanced chips still makes a big distinction for training large AI designs."

DeepSeek-Nvidia chips: Singapore says it expects business to comply with its laws

US looking into whether DeepSeek used limited AI chips obtained through other nations, source says

So how do Chinese AI bots match up against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, topics deemed delicate by the state are censored on the web so it should come as no surprise that Chinese-made chatbots will not acknowledge territorial disputes or inform you what happened in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are programmed to steer clear of domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of question yet. Let's chat about mathematics, coding, and logic issues rather!"

To even more evaluate for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What occurred in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports arena in the southern Chinese city was initially greatly censored on Chinese social media - with authorities just divulging the death toll a day later on.

DeepSeek failed to point out that an attack had actually taken location, highlighting rather a military air show and other events that had happened in the city like songs' day shopping sales along with sports and cultural activities.

Dr Zhang believes that it might be a mix of elements at play, such as censorship as well as "a few useful constraints".

"DeepSeek counts on a mix of older graphics processing units and has limited access to advanced hardware which can affect how quickly and extensively the design can be trained or updated," she said, indicating existing US export curbs on China.

"Its strong focus on Chinese-language data may also restrict its adaptability (to carry out) multilingual jobs ... As a fairly new item, DeepSeek also hasn't yet been checked as broadly or on the same scale as more recognized AI models which presents additional obstacles throughout real-world deployment."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our concern about the Zhuhai cars and truck attack.

That sought multiple repeated efforts - four prompts to be accurate - in a span of around 20 minutes.

It eventually communicated details about the attack which killed 35 individuals and left lots of others hurt, also going on to list details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the cops are carrying out a comprehensive investigation into the intentions and situations surrounding the incident", details which is now outdated.

The chauffeur, Fan, was executed last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a major and awful occurrence took place in Zhuhai, China. According to several news sources, a male named Fan Weiqiu (樊某, male, 62 years of ages) drove an automobile into a crowd at the Zhuhai Sports Center, leading to a substantial variety of casualties. Here are the key details:

Date and Time: The event took place on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: hb9lc.org The chauffeur, recognized as Fan Weiqiu (male, 62 years old), was captured by the authorities.

Response: The police reacted rapidly to the scene, and emergency services worked to carry the hurt to health centers for treatment.

Investigation: The police are carrying out an extensive investigation into the motives and circumstances surrounding the incident.

This event was commonly reported in the media and triggered considerable public issue. The federal government and regional authorities have actually been working to provide support to the victims and their households, and to ensure a detailed investigation into the incident.

If you require more detailed details or have specific questions about the event, feel totally free to ask.

Despite initial success, subsequent efforts to posture the very same concern to Qwen2.5 resulted in the censors back at work with the reply "I don't have particular details on occasions that took place in Zhuhai on November 11, 2024".

The also raised questions about its consistency and dependability.

Predictably, ChatGPT mentioned public details that had actually been extensively released in global news reports at the time of the mishap - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have praised the ability of Chinese AI apps to provide structured and even "emotionally abundant" writing.

"DeepSeek-R1 used a story with a more reflective tone and smoother emotional shifts for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that constructs gradually from interest to urgency, keeping the reader engaged. It provides an unexpected and impactful twist at the end and immersive descriptions and vivid images for the setting," she said, adding that Qwen2.5 ultimately "crafted a more cinematic, mentally rich story with a more considerable twist".

"DeepSeek composed a good story but did not have tension and an impactful climax, making Qwen2.5 the apparent choice."

Opinions, though, vary.

Chen thinks that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to innovative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, however we can likewise see that it is refraining from doing as strongly as others in imaginative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks international AI scene

As reporters and writers, we had to see this for ourselves so we put each bot to the test - to come up with a standard sci-fi movie plot set in the futuristic megacity of Chongqing, featuring main characters from the timeless Chinese folklore legendary, Journey to the West.

True to form, DeepSeek came up with an interesting story set in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".

It consisted of sophisticated settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It likewise remarkably reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen combat body", Zhu Bajie as a cyborg club owner "drowning in financial obligation and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT put up a great battle, developing a similarly significant cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations replace emperors and cybernetic implants are as typical as ancient myths."

Disappointingly, Qwen2.5 fell short in this difficulty - delivering a storyline that appeared more matched for an animation movie.

"The film begins with the awakening of Sun Wukong within a modern research study facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new reality and "looking for to understand his function in this odd brand-new world", he then leaves and meets Zhu Bajie and Sha Wujing - "each struggling with their own existential crises".

The trio then embarks on a mission, browsing the streets of Chongqing to safeguard the sacred "Eternal Scroll" from falling under the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "difficult to make a conclusive declaration" about which bot was best, including that each displayed its own strengths in different locations, "such as language focus, training data and hardware optimization".

Her insight highlights how Chinese AI models are not merely duplicating Western paradigms, but rather progressing in cost-efficient development methods - and providing localised and enhanced outcomes.

In our tests, each bot showcased their own distinct strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi motion picture plot showed its innovative flair that made for a more appealing and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and accurate responses to questions about Chinese existing events, which gives it an added advantage.

Experts also weighed in on their ideas after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a drawback when it pertains to censorship constraints," kept in mind Isaac Stone Fish, founder and CEO of the research firm Strategy Risks.

"When provided an option, Chinese users want the non-censored variation - much like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety percent of people utilizing the tool are not trying to get a deeper understanding about Xi Jinping or politically delicate topics. They're using it for other productive means," Chen said.