Talk by Pietro Lesci

20 November 2024, by DS Group

We are very proud to start our seminar series with a very exciting guest speaker. Pietro Lesci from the University of Cambridge will be giving a lecture at our group seminar.

Title: Natural Experiments in NLP and Where to Find Them

When: Wednesday, 20.11.24 at 4 pm

Abstract:
In training language models, training choices—such as the random seed for data ordering or the token vocabulary size—significantly influence model behaviour. Answering counterfactual questions like "How would the model perform if this instance were excluded from training?" is computationally expensive, as it requires re-training the model. Once these training configurations are set, they become fixed, creating a "natural experiment" where modifying the experimental conditions incurs high computational costs. Using econometric techniques to estimate causal effects from observational studies enables us to analyse the impact of these choices without requiring full experimental control or repeated model training. In this talk, I will present our paper, Causal Estimation of Memorisation Profiles (Best Paper Award at ACL 2024), which introduces a novel method based on the difference-in-differences technique from econometrics to estimate memorisation without requiring model re-training. I will also discuss preliminary results from ongoing work that applies the regression discontinuity design to estimate the causal effect of selecting a specific vocabulary size.

Latest articles

16.07.2025|Speakernews

Talk by Michael Hedderich

We are very excited that Michael will be presenting his work in our group.

When: 16.07.25 10-11.30am

Where to join:

The topic of his talk will be published soon :)
Please feel free to join Michaels talk if you are interested!

16.05.2025|Speakernews

Talk by Max Müller-Eberstein

We are more than happy to have Max Müller-Eberstein as a guest in our lab. He will be presenting several exciting projects.

Title: How Language Model Learning Dynamics Shape Social Inclusion

Abstract: It's an exciting time for studying the Machine Learning theory underlying Language Model training...