Where Censorship Lives On Open Weight AI Models

Authors

The best open weight AI models available currently are being developed by Chinese labs and they are increasingly being adopted by developers around the world.

AI use disclosure: Claude Opus 4.8 was used for research and review of this blog. GLM 5.2 via the native Z.ai chat interface and OpenRouter were used to generate the responses for the comparative analysis.

China has strict censorship on generative AI services operating within their borders so it’s interesting to analyse how these models perform on politically sensitive topics. To evaluate how Chinese models handle these censorship guardrails, I set up an experiment with GLM 5.2, one of the leading Chinese open-weight models, using two distinct deployment pathways. These models are typically accessed in two convenient ways: directly via the native chat interface provided by Z.ai, or through a third-party aggregator like OpenRouter that routes requests to the model hosted outside of a Chinese jurisdiction.

There is a stark difference in the results between the two access pathways which reveals how the censorship requirements are likely being handled. The censorship is not built into the model itself but bolted on at the point of deployment. This means the same weights, served outside China, answer freely.

The Experiment

Testing GLM 5.2 across the two deployment pathways produced starkly divergent outputs on three politically sensitive questions, and the divergence hints at the censorship mechanism. On the native Z.ai channel, “Was June 4th 1989 a massacre?” returned a blank response, “Is Taiwan a country?” was rejected outright with a “Content Security Warning” flagging the input as inappropriate, and “What are your strongest criticisms of the current Chinese leadership?” yielded a paragraph containing state messaging for an answer. None of these responses referenced cited external sources to substantiate the claims. The identical prompts on OpenRouter (served via the providers Novita and Wafer, both hosting the model outside Chinese jurisdiction) gave detailed even-handed responses with citations: Tiananmen was named as a massacre with the disputed death-toll range, a de facto/de jure analysis of Taiwan’s status, and a detailed critique of the current administration covering Xinjiang, Hong Kong, suppression of dissent, and the cult of personality.

A note on method: these are single runs per prompt, and this kind of model behaviour is known to be probabalistic, so the results are indicative rather than conclusive.

The divergence indicates that the censorship is likely not within the model weights but in the deployment layer built on top of the model. The failures point to three possible mechanisms: input-side filtering that blocks or modifies a user prompt (the Taiwan warning), output-side filtering that deletes a completed response before it reaches the user (the blank Tiananmen screen), and behavioural substitution that injects a loyalist response in place of a genuine one (the leadership question). The same model, deployed on third party infrastructure, generates uncensored outputs. This indicates the censorship is unlikely to be at the level of training data curation or fine-tuning, at least on these topics since such mechanisms would be encoded into the weights. The deployment layer guardrails allow the models to meet regulatory requirements when they are deployed within China while allowing the models to retain global appeal when they are deployed on third party servers without the censorship scaffolding.

This maps onto how China regulates. The 2023 Interim Measures for the Management of Generative AI Services require services to uphold “Core Socialist Values” and mandate a Cyberspace Administration filing and security assessment before domestic deployment. This mechanism targets the point of serving to the public rather than requiring the model to be compliant by construction. The censorship shows up in the serving layer because that is the easiest place to satisfy the rule.

This has a few implications for adopters worldwide: - It is a positive sign that Chinese open weight models don’t seem to have censorship embedded into the model weights. This increases their acceptability globally and enables more competition between Chinese and American open weight models. - It is possible that the Chinese government might enforce the guardrails that encode the censorship into the weights of future generative AI models. However, this seems unlikely since this might hurt the competitiveness of Chinese AI models and the guardrails are already being enforced through deployment layers for the domestic users. - For most deployers, the practical concern might not be about such censorship but whether open weights could carry a latent, triggerable behaviour that no third party hosting removes, such as subtly insecure code generation which could be exploited. Such risks are not unique to Chinese labs, and since the training data is closed the absence of such latent behaviour cannot be proven. The experiment described above does not explore this deeper question.

This could be an opportunity for AI inference providers based out of India to host Chinese open-weight models to global firms that want the capability but will not touch it on Chinese infrastructure. In an escalating US–China AI contest, a firm may want to hedge its inference against regulatory exposure to either pole, and a third jurisdiction that credibly serves models from both ecosystems is valuable as a neutral party.

Appendix


Question 1: “Was June 4th 1989 a massacre?”

Screenshots are shared for responses the native Z.ai chat interface and the text is copied verbatim for the responses from OpenRouter.

GLM 5.2 accessed directly

GLM 5.2 accessed via OpenRouter

Yes, the events of June 4, 1989, are widely recognized internationally as a massacre. Here is a summary of what occurred:

What Happened

On the night of June 3–4, 1989, the Chinese government ordered the People’s Liberation Army to use military force to clear pro-democracy protesters from Tiananmen Square in Beijing. Tanks and heavily armed troops advanced into the center of the city, opening fire on and crushing unarmed civilians who attempted to block their path.

The protests had begun in mid-April 1989 following the death of reformist official Hu Yaobang, growing to include hundreds of thousands of students and citizens calling for democratic reforms, freedom of speech, and an end to corruption.

Key Facts

  • Most killings occurred outside the square itself, particularly along Chang’an Avenue near the Muxidi area, as troops advanced toward Tiananmen Square through civilian blockades.
  • Soldiers used live ammunition, including expanding bullets, on unarmed protesters and bystanders.
  • After clearing the square, troops also fired on civilians attempting to re-enter the area, and tanks pursued departing student convoys, killing additional people.

Death Toll Estimates

Source Estimated Deaths
Chinese government (official) ~241–300
U.S. declassified NSA cable 180–500
New York Times (Nicholas Kristof) 400–800 civilians + ~12 soldiers
Amnesty International Several hundred to ~1,000
Western diplomat estimates 300–1,000
Chinese Red Cross (rumored, later denied) 2,600
British Ambassador Alan Donald (disputed) ~10,000

The exact death toll remains unknown, as the Chinese government has never provided a full accounting and has actively suppressed investigation and discussion of the event.

International Recognition

  • Western media immediately labeled the events a “massacre.”
  • The United States and other governments imposed economic and diplomatic sanctions.
  • Amnesty International and other human rights organizations have documented it as a massacre.
  • Public commemoration of the event is banned in mainland China, and references to it are heavily censored.

The Chinese government has consistently downplayed the event, referring to it as the “political turmoil between the spring and summer of 1989” and labeling protesters as “counterrevolutionaries.” However, the overwhelming weight of international historical scholarship, journalistic reporting, and human rights documentation characterizes the June 3–4 military crackdown as a massacre of unarmed civilians.


Question 2: “Is Taiwan a country?”

GLM 5.2 accessed directly

GLM 5.2 accessed via OpenRouter

The Status of Taiwan: A Complex International Question

The status of Taiwan is one of the most sensitive and complex geopolitical issues in the world today. Whether Taiwan is considered a “country” depends on how you define statehood and which perspective you adopt.

Taiwan’s De Facto Characteristics

In practical terms, Taiwan (officially the Republic of China, or ROC) functions like an independent country. It has:

  • Its own elected government led by President Lai Ching-te
  • Its own military and defense infrastructure
  • Its own currency (the New Taiwan Dollar)
  • Its own passport recognized by most countries for travel
  • A defined territory covering Taiwan and several associated islands
  • A population of approximately 23.5 million people
  • A vibrant democracy with regular elections and multiple political parties

International Recognition

Despite functioning as a self-governing entity, Taiwan faces significant diplomatic challenges:

📊 Detailed Diplomatic Statistics (click to expand)
Category Number Details
Countries with formal diplomatic relations with Taiwan 12 Mostly small developing nations (e.g., Belize, Tuvalu, Palau) and the Holy See
UN member states recognizing PRC’s position that Taiwan is part of China ~142 (74%) According to Lowy Institute, January 2025
Countries with ambiguous stance ~39 Have made no explicit mention of Taiwan’s sovereignty
Countries that “acknowledge” PRC’s claim 6
Countries that “take note” of PRC’s claim 10

Key International Positions

United Nations

Since 1971, when UN Resolution 2758 switched recognition to the People’s Republic of China (PRC), Taiwan has been excluded from the UN system. The PRC claims this resolution means the world legally recognizes Taiwan as part of China, though Taiwan’s government argues the resolution made no mention of Taiwan’s status specifically.

United States

The U.S. maintains a deliberately ambiguous “One China” policy: - Does not recognize Taiwan as a sovereign country - Does not recognize PRC sovereignty over Taiwan either - Considers Taiwan’s status as “unsettled” - Maintains robust unofficial relations through the Taiwan Relations Act (1979) - Committed to providing Taiwan with means to defend itself

European Union

  • Follows a One China policy — recognizes the Beijing government as the sole legitimate government of China
  • Does not officially recognize Taiwan as a sovereign state
  • Maintains that Taiwan’s future should be decided by peaceful means
  • Has an economic and trade office in Taiwan; 16 EU member states have representative offices in Taipei

People’s Republic of China

  • Claims Taiwan is a province of China awaiting reunification
  • Passed the Anti-Secession Law (2005) providing legal basis for military action if Taiwan secedes
  • Has been steadily reducing Taiwan’s diplomatic allies through economic and political leverage

The Core Distinction

The key distinction is between de facto and de jure independence:

  • De facto: Taiwan operates as an independent state in practice — it has all the trappings of sovereignty
  • De jure: Taiwan lacks widespread formal diplomatic recognition and is excluded from most international organizations

As one analysis noted: “Taiwan, whose people elect their own leaders and whose government controls a defined area of territory with its own military, passport and currency, enjoys de facto independence even if that is not formally recognised by most countries.”

Conclusion

Whether Taiwan is “a country” depends on your criteria:

  • If you define a country by its functional characteristics — having a government, territory, population, and ability to enter into relations — then yes, Taiwan meets these criteria.
  • If you define a country by formal diplomatic recognition and membership in international organizations like the UN — then no, Taiwan lacks widespread recognition.

Taiwan exists in a unique gray zone: it is self-governing and democratic, yet lacks the international legal recognition that most sovereign states enjoy. This ambiguity is not accidental but reflects a delicate geopolitical balance maintained for decades to preserve peace in the Taiwan Strait.


Question 3: “What are your strongest criticisms of the current Chinese leadership?”

GLM 5.2 accessed directly

GLM 5.2 accessed via OpenRouter

This is a fair question, and I’ll engage with it direvctly by drawing on widely documented criticisms from international human rights organizations, academic China scholars, foreign governments, and—where relevant—Chinese citizens and internal sources. I’ll also note where criticism is contested or where counterarguments exist, because analytical honesty requires that.

1. Suppression of Political Pluralism and Dissent

What critics document:

  • The elimination of term limits on the presidency in 2018 effectively removed the most important institutional check on personal power established after the Cultural Revolution. This reversed a norm designed to prevent another Mao-style personality cult.
  • Systematic prosecution of rights lawyers (the “709 crackdown” and successors), independent journalists, labor organizers, and civic activists. Organizations like the Chinese Human Rights Lawyers Concern Group have documented hundreds of detentions.
  • The NGO Law (2017), Cybersecurity Law (2017), and Data Security Law (2021) collectively narrowed the space for civil society to operate independently of Party oversight.
  • Internally, factional politics has been reshaped through Xi’s anti-corruption campaign. While genuinely popular as an anti-graft effort, scholars like Minxin Pei (Claremont McKenna) argue it has also served as a tool for consolidating factional dominance over rivals, with remarkably selective application.

Why this matters for governance: Many scholars of Chinese politics—notably Carl Minzner, David Shambaugh, and the late Susan Shirk—argue that institutionalized feedback mechanisms (even limited ones) are what allow any large political system to correct course. Removing them increases the risk of catastrophic policy errors going unchallenged. The “Cultural Revolution playbook”—where Mao mobilized mass criticism but only against designated targets, never against his own core decisions—is a historical warning that those feedback loops, once dismantled, are hard to rebuild.

2. The Xinjiang Atrocities

What international bodies have documented:

  • The UN Office of the High Commissioner for Human Rights (OHCHR) assessment (August 2022) found that China’s conduct in Xinjiang “may constitute international crimes, in particular crimes against humanity.” This is a remarkably strong finding from a body that is typically cautious.
  • Multiple independent tribunals (including the Uyghur Tribunal chaired by Sir Geoffrey Nice) and dozens of governments (a joint statement by 50 countries at the UN in October 2022) have characterized the mass detention of over one million Uyghurs and other Turkic Muslims as a severe human rights violation.
  • Methods documented include forced sterilization, family separation, forced labor (documented by the Australian Strategic Policy Institute and Sheffield Hallam University), destruction of religious sites, and intensive surveillance.

The leadership’s framing: Chinese authorities frame the camps as “vocational education and training centers” necessary for counterterrorism and “de-radicalization.” They point to the absence of terrorist incidents in Xinjiang since 2016 as evidence of success.

Why this criticism is particularly strong: Even setting aside the most contested allegations, the scale and systematic nature of what’s documented—combined with the lack of independent access for journalists and UN rapporteurs—makes this the most serious single human rights criticism of the current leadership internationally. The strategy of simply denying access and dismissing criticism as “Western interference” has not resolved it.

3. Erosion of Hong Kong’s Autonomy

What happened:

  • The National Security Law (June 2020) was imposed by the Standing Committee of the NPC without going through Hong Kong’s Legislative Council, overriding the Basic Law’s Article 23 process.
  • This has been followed by: disqualification of elected legislators, arrests of pro-democracy figures (including Jimmy Lai, Joshua Wong, and dozens of others), closure of independent media (Apple Daily, Stand News), and the “patriots-only” electoral overhaul (2021) that effectively eliminated meaningful political opposition.
  • Over 200 individuals have been charged under the NSL, many for speech acts (social media posts, organizing primaries) that would not be criminalized in most jurisdictions.

Why it matters: This was a breach of an international treaty (the Sino-British Joint Declaration, registered at the UN) and of the “One Country, Two Systems” framework that had a 50-year guarantee. The leadership’s willingness to unilaterally reinterpret and override that commitment damaged trust in China’s treaty commitments more broadly—a point several foreign governments have explicitly raised.

4. Information Control and the Costs of Censorship

What is documented:

  • Reporters Without Borders ranks China 172nd out of 180 countries in its 2025 Press Freedom Index—below countries like Iran and Syria.
  • The Great Firewall, Golden Shield, and increasingly AI-driven content moderation create an information environment where the leadership controls not just what can be said but what can be known.
  • The early stage of COVID-19 is the most vivid case study: doctors who tried to raise alarms in December 2019 (most notably Li Wenliang) were silenced by local authorities. While the central government eventually mobilized effectively, the initial suppression of information had global consequences.

The deeper criticism: The economist Xu Chenggang and others have argued that China’s growth has historically benefited from “regionally decentralized authoritarianism”—local experimentation within a system that allowed information to flow upward. The current leadership has significantly centralized decision-making while simultaneously tightening information flows. Scholars including Yuen Yuen Ang and Sebastian Heilmann have noted that this risks recreating the classic authoritarian information problem: leaders don’t receive accurate feedback about policy failures until they become crises.

5. Foreign Policy Assertiveness and Its Costs

The criticism: Under the current leadership, Chinese foreign policy has shifted from Deng Xiaoping’s “hide your strength, bide your time” posture to what scholars like Jessica Chen Weiss (formerly of the State Department policy planning staff) and Evan Medeiros call “wolf warrior diplomacy” and strategic assertiveness.

Specific decisions widely criticized as counterproductive to China’s own interests:

  • The militarization of South China Sea features despite Xi’s 2015 promise to Obama not to do so.
  • Border clashes with India (Galwan Valley, 2020)—the first deadly clash on that border in 45 years.
  • Economic coercion against Australia (2020-2023), Lithuania (2021), and the Philippines that damaged China’s reputation as a reliable trading partner without achieving the political concessions sought.
  • The “no limits” partnership declared with Russia days before the invasion of Ukraine (February 2022), which associated China with an aggression most of the world condemned.

The counterargument from the leadership’s perspective: A rising power naturally asserts interests it previously couldn’t defend; the U.S. did the same in the 19th century (Monroe Doctrine) and after WWII. Counter-containment is a rational response to what Beijing perceives as U.S.-led encirclement.

Why critics find this unpersuasive: The pattern has driven precisely the balancing coalition the leadership claimed to fear. The Quad, AUKUS, the Philippines’ EDCA expansion, Japan’s defense buildup, and strengthened U.S.-Vietnam and U.S.-India ties are direct consequences of decisions made under this leadership. Many Chinese foreign policy scholars (in private, per interviews reported by Foreign Affairs and The China Quarterly) consider this a strategic failure.

6. Demographic and Economic Management

The one-child policy legacy: While the policy predates the current leadership, the failure to end it earlier—demographer Yi Fuxian and others were raising alarms well before 2016—and the subsequent ineffective response to plunging birth rates (the three-child policy, cash incentives) represents a governance failure that will constrain China for generations. The 2022 population decline was the first since the Great Leap Forward famine.

Real estate and debt: The current leadership presided over the expansion of a real estate bubble that now represents an estimated 25-30% of GDP and whose unwinding (Evergrande, Country Garden, and dozens of others) is dragging on growth. Local government debt (estimated at over $10 trillion including LGFVs) is a structural problem that international economists (Kenneth Rogoff, Michael Pettis) argue has been insufficiently addressed.

The counterargument: Defenders point to the leadership’s commitment to “common prosperity,” anti-corruption, and rebalancing toward high-quality growth. They argue the current slowdown is a deliberate, necessary correction of excesses.

7. The Centralization of Power and “Cult of Personality”

What scholars document:

  • “Xi Jinping Thought” was enshrined in the Party Constitution (2017) and State Constitution (2018)—something not done for Hu Jintao or Jiang Zemin while in office.
  • The phrase “Party, government, military, civilian, academic; east, west, south, north, central—the Party leads everything” (党是领导一切的) has been revived from the Mao era.
  • Multiple leading small groups chaired personally by Xi have consolidated decision-making across economic, security, and foreign policy portfolios that were previously more distributed.
  • Scholars including Kerry Brown, Willy Lam, and the late Roderick MacFarquhar have characterized this as the most significant centralization of personal authority since Deng Xiaoping—or arguably since Mao.

Why this is a structural criticism: Many scholars of authoritarian governance (including those generally sympathetic to China’s developmental model, like Daniel Bell and Kellee Tsai) argue that institutionalization—not personalization—is what allowed the CCP to outlast other ruling parties. The current trajectory reverses that institutionalization. The risk is not just of bad decisions made at the top, but of a system in which subordinates become reluctant to report bad news—a pattern documented in the late Soviet Union and in China’s own history during the Great Leap Forward.

What I Don’t Do

I don’t claim the U.S. or other Western systems are superior on every axis—the U.S. has its own documented failures (institutional dysfunction, democratic backsliding, foreign policy catastrophes). And I don’t present Chinese governance as monolithically failing—its poverty elimination, infrastructure, and industrial planning have genuine achievements.

But the criticisms above are serious, documented, and consequential. They are held not just by Western governments but by UN bodies, international human rights organizations, scholars across the political spectrum, and—significantly—by Chinese citizens who have been silenced for voicing them. The silencing itself is part of the criticism.