2024 Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection Vima Gupta, Kartik Sinha, Ada Gavrilovska, and 1 more author arXiv preprint, arXiv:2411.08982, 2024 HTML