[LIVE Webinar] Evaluating LLM Responses

30 Jan 2025
08:00 AM to 09:00 AM
In your local timezone

Members: Free
Non-Members: $75

Please log in to your GALA account to register. Don’t have an account yet?
Create one for free.

 

The final session of the 3-part prompt design webinar focuses on evaluating the output of prompts, diagnosing issues, and refining the approach. You’ll learn to measure LLM performance, choose the right evaluation data set, and mitigate risks. Topics include:

•    Methods for prompt evaluation.  
•    Understanding the F1 score.  
•    Types of hallucinations and strategies for managing them.  
•    Recognizing and addressing data contamination and other risks.  
•    Best practices for robust prompt design.
 

Target Group

•    Localization professionals and project managers
•    Translators, post-editors, language specialists
•    Content creators 
•    Localization vendor managers 
•    Educators and university lecturers in Translation Studies 
•    Product managers of AI tools to be used by linguists and translators

 

Register for Session 1, Demystifying LLMs: What Can They Actually Do Well?

Register for Session 2, The Science and Art of Prompt Design

Host organization: RWS

Event Speakers

Marina Pantcheva
RWS

I am a linguist and polyglot with a rich experience in academic pursuits (research, teaching and science popularization) as well as management, leadership and innovation. I hold a PhD degree in Theoretical Linguistics. My academic work centered around the exploration of the elementary particles of language within the innovative framework of Nanosyntax. In 2014, I transitioned to the fast-paced world of Localization. Over the course of several years, I lead a team developing processes and solutions for crowd-based localization, covering technology, BI, linguistic quality, Community management and more. Currently, I am heading the Linguistic AI Services Center of Excellence at RWS, dedicating my efforts to the development and implementation of linguistic AI solutions. I am a fervent advocate for the use of clear language. I am equally passionate about knowledge sharing and am frequently involved in outreach initiatives, such as public presentations, blog contributions, podcasts and other events dedicated to the dissemination of knowledge. In my spare time, I paint, read, and engage in research inspired by the vast amount of data I encounter in my daily work.