Building KooBERT
Singhal joined Koo as Senior Director and Head of Machine Learning in 2021, taking over a team of three engineers, and scaled that team to twenty over his tenure, creating a multidisciplinary group spanning data science, machine learning engineering, and MLOps. His most consequential technical contribution was leading the development of KooBERT, an open-source multilingual transformer model built specifically for Indian-language content. General multilingual models existed at the time, but they had been trained on formal text and performed poorly on the code-mixed, transliterated, script-variable content that characterized Koo's users. KooBERT was engineered to handle those patterns directly, covering more than 20 languages and serving as the foundation for both moderation and content recommendation across the platform.