I'm an applied ML researcher focused on AI safety, evaluation, and building systems that work reliably at scale. Previously at Hinge (LLM safety evaluation), Apple Podcasts (search + recommendations), and Google (ML evaluation frameworks). Completed BlueDot Impact's Technical AI Safety program in December 2025.
Designed LLM safety evaluation pipelines for an AI-powered dating feature
Built podcast recommendation systems from startup to 500M+ users