The serving technique that processes multiple requests simultaneously to maximize GPU utilization
4 views