Factlen ExplainerPro-Social AIExplainerJun 25, 2026, 2:36 AM· 6 min read

How 'Bridging Algorithms' Are Curing the Toxic Comment Section

After years of shutting down comment sections due to toxicity, publishers are using a new generation of AI tools that actively reward nuance, curiosity, and reasoning over outrage.

By Factlen Editorial Team

Share this story

Civic Technologists 35%Publishers & Moderators 30%Community Managers 20%Industry Analysts 15%

Civic Technologists: Focus on building tools that foster mutual understanding and scale constructive dialogue.
Publishers & Moderators: Focus on the practical benefits of reducing moderation costs and safely re-engaging their audiences.
Community Managers: Focus on brand protection, filtering spam, and highlighting genuine customer interactions.
Industry Analysts: Focus on the broader implications of AI moderation on the future of the digital public square.

What's not represented

· Free speech absolutists who oppose any form of algorithmic curation
· Everyday readers whose comments are incorrectly flagged by the AI

Why this matters

For years, toxic comment sections drove publishers to shut down the internet's public squares, leaving readers without a voice. The deployment of 'pro-social' AI is finally reversing this trend, allowing newsrooms to safely reopen comment boards and reward thoughtful dialogue over outrage.

Key points

Publishers are reopening comment sections using AI tools that reward constructive dialogue.
New 'bridging algorithms' score comments for reasoning, curiosity, and personal storytelling.
Instead of just deleting bad words, the AI re-ranks discussions to put the most nuanced comments at the top.
Readers report that AI-sorted conversations feel significantly more trustworthy and respectful.
The technology evaluates the structure of an argument, not the underlying political ideology.

2 billion

Comments processed daily by Perspective API

Increase in NYT articles open to comments

0.8

Example toxicity probability score

For much of the internet’s history, the comment section was widely regarded as a lost cause. By the mid-2010s, major publishers like NPR and Vice began shutting down their article comment boards entirely, citing a toxic environment that had become impossible to manage. The sheer volume of spam, harassment, and outrage outpaced the capacity of human moderators, while the blunt keyword filters of the era proved woefully inadequate. If a reader used a banned word in a thoughtful critique, the comment was deleted; if a troll used polite vocabulary to deliver a vicious personal attack, the system let it through. The public square of the internet was effectively boarded up, pushing audience engagement onto third-party social media platforms where publishers had no control over the data or the discourse.[4]

Today, that trend is rapidly reversing, driven by a new generation of artificial intelligence designed not just to police bad behavior, but to actively reward good citizenship. Rather than relying on simple blocklists, modern moderation tools utilize large language models capable of understanding context, sentiment, and nuance. This technological leap has allowed the industry to shift its focus from a defensive posture—asking 'what should we hide?'—to a proactive one, asking 'what should we highlight?' By identifying the hallmarks of healthy conversation, these systems are allowing newsrooms to reopen their digital doors and rebuild direct relationships with their readers.[7][8]

At the forefront of this shift is Jigsaw, a Google subsidiary that launched the Perspective API in 2017. Originally, Perspective was designed to score the 'toxicity' of a comment, assigning a probability that a human reader would find a given phrase abusive or insulting. For example, a blatant insult might receive a toxicity score of 0.8, automatically flagging it for removal. The tool was massively successful, eventually processing nearly two billion comments a day across 18 languages for platforms ranging from Reddit to major daily newspapers. However, Jigsaw’s researchers soon realized that simply deleting the worst comments did not inherently create a productive environment. The absence of toxicity is not the same thing as the presence of value.[1][5]

How modern AI moderation evaluates the structure of an argument rather than just scanning for banned words.

To address this, Jigsaw introduced a suite of 'experimental bridging attributes' to the Perspective API. Instead of scanning for abuse, these new classifiers scan for traits that correlate with constructive dialogue: reasoning, curiosity, and the sharing of personal stories. The AI evaluates whether a commenter is attempting to understand another viewpoint, providing evidence for their claims, or showing deference to other users. When newsrooms apply these bridging attributes, they can fundamentally alter the architecture of the comment section. Instead of sorting responses chronologically—which often allows the loudest or earliest voices to dominate—platforms can re-rank the discussion so that the most nuanced and empathetic comments appear at the very top.[2][6]

The results of this re-ranking have been striking. In studies conducted by Jigsaw, readers who interacted with comment sections sorted by constructiveness and curiosity reported that the conversations felt significantly less hostile. More importantly, they rated the discussions as more informative, respectful, and trustworthy. By changing the incentive structure, the algorithm subtly shapes user behavior. When commenters see that thoughtful, well-reasoned arguments are rewarded with premium visibility, they are more likely to emulate that tone, creating a virtuous cycle of engagement that drowns out the remaining trolls.[2][6]

More importantly, they rated the discussions as more informative, respectful, and trustworthy.

This algorithmic approach is complemented by open-source initiatives like The Coral Project. Originally launched as a collaboration between the Mozilla Foundation, The New York Times, and The Washington Post, Coral was built specifically to help journalists foster meaningful conversations without draining newsroom resources. Coral’s primary tool, the 'Talk' platform, integrates AI moderation with behavioral nudges. For instance, if a user types a comment that the AI flags as potentially toxic, the system can pause the submission and display a warning, giving the user a chance to revise their language before posting. This friction point alone prevents a significant amount of heat-of-the-moment vitriol from ever reaching the public feed.[3][4]

The business case for these pro-social algorithms has proven impossible for publishers to ignore. By automating the most grueling aspects of moderation, newsrooms can scale their community engagement without linearly scaling their headcount. The New York Times, an early adopter of these advanced AI tools, was able to triple the number of articles it opened to reader comments. For publishers, hosting these conversations natively on their own websites is highly lucrative; engaged commenters spend more time on site, view more pages, and are significantly more likely to convert into paying subscribers than passive readers.[1][4]

Readers report significantly higher trust and respect when comment sections are sorted by constructiveness.

Despite the clear benefits, the deployment of AI in public discourse inevitably raises questions about censorship and bias. Critics often worry that an algorithm designed to reward 'constructiveness' might inadvertently suppress dissenting opinions or sanitize passionate political debate. However, developers emphasize that bridging algorithms are trained to evaluate the structure and tone of a comment, not its underlying ideology. A fierce critique of a government policy that relies on structured reasoning will score highly, while a low-effort, name-calling endorsement of that same policy will be demoted. The goal is not to mandate agreement, but to elevate the quality of the disagreement.[6][8]

The mechanics of these platforms are designed to be as transparent as possible. When a newsroom deploys a system like Coral’s Talk platform, the community guidelines are placed front and center, directly above the text box. This simple UI decision, backed by academic research, serves as a psychological primer for the user. Furthermore, the system allows journalists to easily highlight and pin exceptional contributions, creating a 'featured comments' pane that serves as a model for the rest of the community. By visibly rewarding high-quality input, publishers establish a cultural norm that newcomers are implicitly pressured to follow.[3][4]

Beyond traditional journalism, the success of these bridging algorithms is beginning to influence the broader architecture of the internet. Social media platforms, long criticized for algorithms that prioritize outrage to maximize user retention, are closely monitoring the deployment of pro-social AI. If tools like the Perspective API can prove that constructive, nuanced conversations actually lead to longer session times and deeper user loyalty, the financial incentives of the entire social web could begin to shift. The transition from 'engagement at all costs' to 'quality engagement' represents a potential turning point in the history of digital media.[5][6]

Community managers can now rely on AI to highlight genuine customer interactions and flag high-quality contributions.

Ultimately, the resurrection of the comment section is about more than just software; it is about reclaiming the democratic promise of the internet. For years, the loudest and most toxic voices were allowed to hold the digital public square hostage, convincing publishers that everyday readers simply could not be trusted to interact civilly. By leveraging artificial intelligence to filter out the noise and amplify the thoughtful majority, newsrooms are proving that the internet is not inherently toxic. When given the right tools and the right environment, people are still capable of listening, reasoning, and connecting with one another.[7][8]

How we got here

2015
The Coral Project is launched as a collaboration between Mozilla, The New York Times, and The Washington Post to rethink audience engagement.
2017
Google's Jigsaw launches the Perspective API to help moderators automatically identify toxic language.
2024
Jigsaw introduces 'bridging attributes' to Perspective API, shifting the focus to actively rewarding nuance and curiosity.
2025–2026
Contextual AI moderation tools become standard across the publishing industry, allowing major outlets to reopen comment sections at scale.

Viewpoints in depth

Civic Technologists' View

Prioritizing the structural health of online discourse through algorithmic incentives.

Researchers and developers in this camp argue that the internet's toxicity problem is largely a design failure. By optimizing purely for engagement, early algorithms naturally surfaced outrage. Civic technologists believe that by retraining large language models to recognize and reward 'bridging attributes'—such as intellectual humility and personal storytelling—platforms can artificially engineer a more empathetic public square. They view AI not as a censor, but as a digital facilitator that nudges human behavior toward its better angels.

Publishers' View

Balancing the desire for audience engagement with the economic realities of moderation.

For newsrooms, the comment section has always been a double-edged sword. While highly engaged readers are the most likely to buy subscriptions, the labor costs associated with manually deleting spam and hate speech forced many outlets to abandon native comments entirely. Publishers view pro-social AI primarily as an economic lifeline. By automating the heavy lifting of moderation and surfacing high-quality contributions, these tools allow media companies to reclaim their audiences from third-party social networks without bankrupting their editorial budgets.

Community Managers' View

Protecting brand integrity and ensuring safe spaces for genuine customer interaction.

Professionals tasked with managing brand pages and online communities focus on the immediate, practical applications of contextual AI. For them, the value lies in the algorithm's ability to distinguish between a frustrated customer who needs urgent support and a bad-faith troll attempting to derail a thread. This camp emphasizes that effective moderation is about establishing clear house rules and enforcing them consistently, using AI as an 'always-on' security guard that prevents public relations crises before they ignite.

What we don't know

How effectively these bridging algorithms will scale across non-English languages with complex cultural nuances.
Whether highly motivated bad actors will eventually find ways to 'game' the constructiveness metrics using AI-generated text.
The long-term impact of AI-curated public squares on the broader polarization of society outside of moderated platforms.

Key terms

Bridging Attributes: AI metrics that detect qualities like reasoning, curiosity, and personal storytelling to identify comments that build mutual understanding.
Pro-Social Algorithm: A sorting system designed to elevate content that fosters healthy community interaction, rather than optimizing purely for engagement or outrage.
Large Language Model (LLM): Advanced AI systems capable of understanding the context, sentiment, and nuance of human text, rather than just scanning for banned keywords.
Contextual Moderation: The practice of evaluating a comment based on its intent and surrounding discussion, rather than relying on blunt keyword filters.

Frequently asked

Does AI moderation mean opinions are being censored?

No. These tools are designed to evaluate the tone and structure of a comment, not its underlying ideology. A well-reasoned critique is rewarded, while low-effort abuse is demoted, regardless of the political viewpoint.

Why did so many news sites remove comments in the first place?

In the 2010s, the manual labor required to delete spam and toxic abuse became too expensive for most publishers, leading outlets like NPR to shut their sections down entirely.

How does the AI know what 'curiosity' looks like?

The models are trained on thousands of human-labeled examples where commenters ask genuine questions, share personal experiences, or acknowledge another person's point before disagreeing.

Can users bypass these AI filters?

While no system is perfect, modern large language models understand context and nuance, making it much harder for trolls to bypass filters by simply misspelling banned words or using polite vocabulary to deliver insults.

Sources

[1]JigsawCivic Technologists
Supporting Online Conversations at Scale with AI
Read on Jigsaw →
[2]MediumCivic Technologists
Perspective API introduces experimental bridging attributes
Read on Medium →
[3]MozillaPublishers & Moderators
The Coral Project joins Vox Media to grow its journalism tools
Read on Mozilla →
[4]Columbia Journalism ReviewPublishers & Moderators
What is the comment box for? The Coral Project aims to find out
Read on Columbia Journalism Review →
[5]ZDNetCommunity Managers
Google's Jigsaw releases new tools to address negative online dialogue
Read on ZDNet →
[6]Tech and Social CohesionCivic Technologists
Jigsaw's Perspective API promotes positive interaction
Read on Tech and Social Cohesion →
[7]FeedGuardiansCommunity Managers
The Top AI Comment Moderation Tools for 2025
Read on FeedGuardians →
[8]Factlen Editorial TeamIndustry Analysts
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →

Stay informed

Every angle. Every day.

Get perspectives stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse perspectives