Publications

(2026). SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations. ICLR'26.

PDF Cite

(2026). Mitigating Non-IID Drift in Zeroth-Order Federated LLM Fine-Tuning with Transferable Sparsity. ICLR'26.

PDF Cite

(2025). Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity. ICLR'25.

PDF Cite

(2024). Ranking with Slot Constraints. KDD'24.

PDF Cite Code Poster DOI

(2023). Coordinating Distributed Example Orders for Provably Accelerated Training. NeurIPS'23.

PDF Cite Code Poster

(2022). GraB: Finding Provably Better Data Permutations than Random Reshuffling. NeurIPS'22.

PDF Cite Poster

(2022). MCTensor: A High-Precision Deep Learning Library with Multi-Component Floating-Point. HAET workshop at ICML'22.

PDF Cite Code Poster Video

(2021). Assessing the efficacy of large language models in generating accurate teacher responses. BEA workshop at ACL'23.

PDF Cite