You're Not Testing Your AI Well Enough