Harsh Singhal: The AI Engineer Who Taught Machines To Understand Hate Speech In 20 Languages

Published on: 12 June 2026 4:58 pm

Harsh Singhal built KooBERT, a groundbreaking multilingual transformer that detects hate speech and toxicity across 20+ languages, transforming content moderation, safety, and personalization on India’s Koo platform and beyond.

Nexa Desk

Published on: 12 June 2026 4:58 pm

Harsh Singhal

In June 2022, Global Witness and Foxglove submitted 20 test advertisements to Facebook ahead of Kenya's national elections. The ads contained explicit hate speech drawn from real-life examples, calling for ethnic violence, rape, and beheadings. Facebook approved them. All 20, in both Swahili and English, passed through the platform's automated moderation systems without being flagged.

It was the third time Global Witness and Foxglove had run similar tests on Facebook, following earlier investigations in Myanmar and Ethiopia that produced comparable results. The pattern across all three was consistent: a platform with billions of users and significant moderation resources continued to fail in non-English linguistic environments where the cultural context, dialectal variation, and script complexity of user content fell outside what its automated systems had been built to handle.

A 2026 Tech Policy Press analysis noted that despite years of industry attention to the problem, the multilingual AI gap had largely been rebranded rather than resolved, with expanded language coverage masking the fact that most AI systems still lacked genuine governance capability across the world's linguistic range. Harsh Singhal spent two years building a more serious answer to that problem, at scale, in India, on a platform where the linguistic complexity was among the highest any social network had ever tried to govern.

What Made India Different

Koo launched in 2020 as a multilingual social platform built to serve Indian users in their own languages, reaching approximately 60 million users by late 2022. That growth put immediate pressure on a content moderation infrastructure that, like most in the world, had been built for English. Research on Indian social media has consistently shown that code-mixing, blending multiple languages within a single post, is the dominant mode of online communication for hundreds of millions of users across the country. A Hindi speaker on Koo might write in Devanagari script, in romanized transliteration, in a blend of Hindi and English within the same sentence, or in any combination, and Indian languages also follow subject-object-verb structures that invert the grammatical patterns English-trained models use to parse meaning and detect hostile intent.

Building KooBERT

Singhal joined Koo as Senior Director and Head of Machine Learning in 2021, taking over a team of three engineers, and scaled that team to twenty over his tenure, creating a multidisciplinary group spanning data science, machine learning engineering, and MLOps. His most consequential technical contribution was leading the development of KooBERT, an open-source multilingual transformer model built specifically for Indian-language content. General multilingual models existed at the time, but they had been trained on formal text and performed poorly on the code-mixed, transliterated, script-variable content that characterized Koo's users. KooBERT was engineered to handle those patterns directly, covering more than 20 languages and serving as the foundation for both moderation and content recommendation across the platform.

Mayank Bidawatka, Co-founder of Koo, described the significance of Singhal's contribution. "His technical vision and leadership not only advanced the state of multilingual AI and content safety but also left a lasting legacy in India's digital transformation," Bidawatka said, "demonstrating how responsible AI can empower local communities while setting new standards for scalable, ethical technology in social networking."

Alongside KooBERT, Singhal led the early adoption of Meta's LLaMA models, fine-tuned for multilingual toxicity detection, making Koo one of the first social platforms globally to deploy fine-tuned large language models for real-time safety applications. Deploying LLMs for real-time moderation at social media latency, across ten languages simultaneously, required infrastructure that did not exist off the shelf, and building it meant accepting operational overhead that most teams were unwilling to take on at that stage. "Fine-tuned LLMs for real-time content moderation was well ahead of where the industry consensus was at that point," Singhal said. "The inference latency requirements were tight, the operational overhead was significant, and a lot of smart people thought the complexity outweighed the benefit. We looked at what the alternatives could actually do in our language environment and concluded we needed something better."

Beyond Moderation

The AI systems Singhal's team built at Koo did more than remove harmful content. The same multilingual language understanding that powered moderation also powered discovery, and under his leadership the team built Semantic Search, Multilingual Topics, Feed Ranking, Content Recommendations, People You May Know, and Trending Tags across all supported languages, personalization capabilities that most platforms had never attempted at this scale in Indian languages. Press coverage at the time of the

Topics launch across 10 Indian languages cited Singhal directly, and a subsequent Business World report covered the feature as evidence that AI-powered multilingual personalization could be built to work in production for vernacular audiences at real scale.

The platform's multilingual safety capabilities also attracted recognition beyond India. Content moderation work extended to Portuguese-language content in Brazil, where the platform had a growing user base, adding another layer of cross-linguistic complexity to systems already operating across ten Indian languages.

A Technical Legacy That Outlasts the Platform

Koo shut down in July 2024, following the resolution of the regulatory disputes that had originally accelerated its growth. KooBERT remains open-source. The methodologies Singhal's team developed for multilingual content understanding, combining transformer architectures with code-mixing awareness, cross-script normalization, and fine-tuned LLMs for real-time safety, advanced the technical state of the art in a domain where most of the industry had accepted English-centric tools as the default. In a country with over 750 million internet users communicating across dozens of languages, building AI systems capable of understanding what people are actually saying was among the most consequential engineering problems in the Indian technology sector, and Singhal's work at Koo stands as one of the most thorough attempts to solve it properly.

The above information does not belong to Outlook India and is not involved in the creation of this article.

Harsh Singhal: The AI Engineer Who Taught Machines To Understand Hate Speech In 20 Languages

What Made India Different

Building KooBERT

Beyond Moderation

A Technical Legacy That Outlasts the Platform

Tags

RELATED STORIES

The Power Of Consistent Saving: Turning A Savings Plan Into A Reliable Retirement Plan

Bajaj Finance Simplifies EMI Planning With A Loan Against Property EMI Calculator

Krishvi Group: Residential Architecture In Bengaluru – Designing Homes For The Future

Synthite’s Transformation: Building A Business Family With Global Ambitions

Watch

WATCH | 80-Year-Old War Hero Walks 50km to Offer Service Again After Pahalgam Attack

Video | Ground Report: Evictions in Assam Ahead of Polls | Voices from Kamrup

Video | Liberation Or Violation? The Legal & Moral Questions Behind The US-Israel Strike On Iran

Video | ‘Fight for Justice Is Not Yet Over’ | Epstein Survivor Marina Lacerda Speaks to Outlook

Photos

In Photos: Assam Floods, Displaced Families Amid Monsoon Havoc

Day In Pics: July 22, 2026

Opposition-Led Protests Intensify, Demand Resignation Of Education Minister Dharmendra Pradhan

Tour de France 2026: Evenepoel Continues Winning Run To Win Stage 16, Pogacar Retains Lead

Latest Sports News

India Vs Zimbabwe, 1st T20I: Shreyas Iyer Stands Firm On Fearless Brand Despite Recent Setbacks

Kuldeep Yadav To Play Eight Matches For Yorkshire Across Multiple Formats

Vaibhav Sooryavanshi Admits International Cricket's First Big Lesson

India Vs Zimbabwe Preview, 1st T20I: Shreyas Iyer-Led IND Seek First Victory Under New Captain

IPL 2009 FEMA Case: Lalit Modi, BCCI Get Major Relief As Tribunal Quashes ED Penalties

Behind The Millions Of Views: How Fake Trump And Yamal Posts Hijacked The FIFA World Cup Final

World Cup Hangovers? Here's Your Guide To Premier League Week 1 - David Vs Goliath Opener, New Managers And More

Gothia Cup 2026: The Result That Gives Indian Football Another Reason To Believe

'The Pain is Immense': Lionel Messi Breaks Silence After Argentina's FIFA World Cup 2026 Final Loss To Spain

When Is UEFA Champions League 2026/27 Main Draw? Check Dates, Format Details, Qualified Teams - All You Need To Know

WTA Mandates SRY Genetic Sex Testing Under New Eligibility Policy

Cincinnati Open 2026: Carlos Alcaraz Returns After Wrist Injury Ahead Of US Open Title Defense

Column | Who Will Carry the Legacy of Indian Tennis After Paes, Bhupathi and Mirza?

How Alexander Zverev Manages Type 1 Diabetes While Competing At Wimbledon

Jannik Sinner Vs Alexander Zverev, Wimbledon 2026 Final: Italian Star Defends Title, Clinches His Fifth Grand Slam

BWF Japan Open 2026: Vintage PV Sindhu Stuns Akane Yamaguchi In Straight Games To Win Maiden Tokyo Title

BWF Japan Open 2026: PV Sindhu Defeats Chen Yu Fei In Straight Games To Book Spot In Final

Japan Open 2026: Ayush Shetty, Unnati Hooda And Lakshya Sen Bow Out In Round Of 32

PV Sindhu Sails Into The Second Round Of Japan Open; Satwik-Chirag Pair Pulls Out

PV Sindhu Vs Chen Yufei Highlights, BWF China Open 2026: Indian Star Falls After Squandering One-Game Lead

Trending Stories

Exclusive | 'Doctors Told Me These Were Pellets': Protester Undergoes Surgery After Delhi Protest

CJP Protest in Delhi LIVE: Wangchuk Agrees To End Hunger Strike, Seeks No Legal Action Against Protesters

Day In Pics: July 22, 2026

Exclusive | Outlook India Reporter Suffers Pellet Wounds During Police Crackdown On CJP Protest

Dharmendra Pradhan’s Daughter Faces Online Backlash Over Foreign Education, Disables Instagram

Samay Raina And Ranveer Allahbadia Reunite For ‘The Great Indian Kapil Show’ World Laughter Day Special Episode

Singer Swagatha S Krishnan Calls Music Composer “Epstein Of Madras”, Alleges Sexual Assault And Covert Recording

10 South Indian Actresses Who Made Their Mark In Bollywood

Assamese Feature Film ‘Moromor Deuta’ Trailer Out, Set For May 15 Release

The Curious Case Of Jana Nayagan: Why Vijay’s Swansong Has Stirred Up A Political Storm

US Strike on Suspected Drug-Smuggling Boat Kills Three, Death Toll Reaches 205

US–Iran Peace Deal: What's on Table, What's Blocking It & Where Things Stand

Trump Warns of Possible Renewed Strikes on Iran, Says Tehran’s Navy and Air Force Are Destroyed

Trump Heads To China For Xi Summit, Says US Does Not Need Beijing’s Help On Iran

Trump And Xi To Meet Amid Fragile Iran Ceasefire And US-China Trade Tensions

US Israel Attacks Iran: IRGC Threatens ‘Complete Destruction,' Israel Struck Iranian Military Complex Near Tehran

Houthi Maritime Ban Forces India, China-Bound Tankers to Reverse Course

NYC Cannot Arrest Netanyahu, But US Government Should, Says Zohran Mamdani

After CJP Protests, Hong Kong's 'Be Water' Strategy Draws Attention In India

As Famine Looms Over Resource-Deprived Gaza, Palestinians Killed While Seeking Aid

Latest Stories

Assam Floods: 10 Dead in 24 Hours, Over 6.5 Lakh Affected Across 11 Districts

PV Sindhu Vs Chen Yufei Highlights, BWF China Open 2026: Indian Star Falls After Squandering One-Game Lead

DCGI Clears India's First Dengue Vaccine For People Aged 4-60 Years, Offering New Tool Against Rising Disease Burden

Kashmir Police Detain Over 1,000 Suspected OGWs After Cop Killed During Amarnath Yatra Duty

Ayesha Khan Was Detained By Mumbai Police Despite Not Even Saying A Word About Protest