Paper-Conference

Characterization, modeling, and verbs-level emulation of long-haul RDMA with implications for federated learning.

Jan 1, 2026

Fault-tolerant LLM pretraining system for 100k+ GPU scale using stacked parallelism and adaptive reordering.

Jan 1, 2026

Scalable federated learning via memory-efficient and concurrent aggregation.

Jan 1, 2026

Characterization of R1-like large reasoning models on HPC-scale GPU clusters and interconnects.

Jan 1, 2025

Discrete-event based performance simulation for federated learning systems across heterogeneous compute/network settings.

Jan 1, 2025

Communication-centric study of long-haul RDMA for geo-distributed federated learning.

Jan 1, 2025

Conference paper on deep learning techniques for EEG-based brain-computer interfaces (BCI).

Oct 1, 2023