This role sits at the intersection of research rigor and product impact. You'll diagnose failure patterns in AI-generated presentations, docs, and websites, then craft targeted improvements through iterative experimentation. You'll build the tools and workflows that enable rapid testing, validate changes against quality benchmarks, and ensure our AI gets smarter with every iteration. If you're obsessed with output quality and love the challenge of making AI systems work beautifully at scale, this is your role.