Publications

2024

  1. lynx_preview.png
    Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection
    Vima Gupta, Kartik Sinha, Ada Gavrilovska, and 1 more author
    arXiv preprint, arXiv:2411.08982, 2024