Talks

2025

November 2025

Towards Native Provenance at University of Washington

October 2025

BeyondWeb: Google Multilinguality Group

September 2025

Guest Lecture at CS294, UC Berkeley on Synthetic Data for Pretraining Course WebPage

July 2025

MAGIC: Diffusion Model Memorization Auditing via Generative Image Compression @ MemFM, ICML'25

July 2025

Unlocking Post-hoc Dataset Inference with Synthetic Data @ Dig-BUGS, ICML'25

July 2025

Panelist @ FPF Technologist Roundtable Series for Policymakers on "Machine Unlearning"

April 2025

Dataset Inference, Unlearning, and Memorization Report Cards @ OpenAI

March 2025

Safety Pretraining @ Schmidt Sciences Safety Convention

2024

October 2024

Mentorship Panel at COLM @ Penn-MLR

September 2024

Guest Lecture on Data Curation @ CMU-10605

August 2024

LLM Dataset Inference @ Private-NLP, ACL 2024

August 2024

Rethinking Memorization with Adversarial Compression @ CONDA Workshop, ACL 2024 (Best Paper Talk)

August 2024

LLM Dataset Inference @ Google Privacy Seminar. Youtube Link

2023

2022

June 2022

Characterizing Datapoints via Second-split Forgetting @ SCIS ICML 2022