High-Throughput LLM Inference Engine