We need data engineering benchmarks for LLMs