Tokasaurus: An LLM inference engine for high-throughput workloads