Data Annotation for Indian Languages: Benchmarking, Standards, and Best Practices

27 Mar 2025
04:00 AM to 05:00 AM
In your local timezone

This event has expired, video available

To view the recording, you must be logged in with a GALA Member account or have purchased the webinar.

This webinar is organized in collaboration with the Confederation of Interpreting, Translation and Localisation Businesses (CITLoB).

 

Join us for a dynamic session on annotating Indian languages, with a spotlight on Hindi and Urdu!

  • We’ll explore how script differences, dialects, and code-mixing affect quality and consistency.
  • Learn about proven frameworks—like Universal Dependencies—and discover how clear guidelines plus robust QA boost accuracy.
  • See why standardized benchmarks are crucial to evaluating model performance and fueling innovation.
  • Dive into real-world examples, where successful annotation projects have transformed speech recognition, sentiment analysis, and more.
  • Gain insights into tackling common challenges: from selecting the right tools to mitigating bias in multilingual data.
  • Whether you’re a data scientist, project manager, or language expert, you’ll walk away with actionable strategies to enhance your annotation workflows.
  • Expect interactive elements, practical tips, and a forward-looking view on how annotated data can unlock AI’s full potential across India’s diverse linguistic landscape.

 

Host organization: Globalization and Localization Association

Event Speakers

Dr Sahil Chandolia
MoniSa Enterprise

Co-founder and CEO of MoniSa Enterprise Pvt Ltd. 

Monica Mohan
MoniSa Enterprise

Co-founder and COO of MoniSa Enterprise Pvt Ltd. 

Akshay Moolchandani
MoniSa Enterprise

Operations Head of MoniSa Enterprise Pvt Ltd.