Experience

  1. Graduate Research Assistant (Federated Learning Systems)

    University of California, Merced
    • Designed a scalable and fault-tolerant modeling and simulation framework for complex federated learning (FL) workflows across cross-silo and cross-device scenarios.
    • Developed a discrete-event simulator on SimGrid to model FL training, communication, and aggregation with heterogeneous compute and network resources.
    • Implemented a communication-centric state machine supporting synchronous, asynchronous, and semi-asynchronous aggregation.
    • Built a user-space, MPI-level long-haul RDMA simulator to compare RDMA vs. TCP/IP for geo-distributed FL workloads, and verified on ESNet testbed.
  2. Graduate Research Assistant (LLM Reasoning Models on HPC)

    University of California, Merced
    • Characterized inference and distillation performance of large reasoning models on HPC-scale GPU clusters and interconnects.
    • Scaled model deployment up to 3,840 GPUs using data/tensor/pipeline/expert parallelism, prefill-decode disaggregation, and KV-cache transfer engines.
    • Analyzed pipeline imbalance, communication overhead, and KV-cache bottlenecks to identify efficient configurations for scalable inference.
    • Tools: Python, C/C++, CUDA, MPI/NCCL, DeepSpeed/Megatron, vLLM, VeRL, Ray; profiling with PyTorch Profiler and Nsight Systems.
  3. Software Engineer Intern (GUI Development)

    NXP Semiconductors
    GUI development internship.
  4. Software Engineer Intern (iOS Backend Development)

    Tianjin University Software Studio (TWT)
    iOS backend development internship.

Education

  1. Ph.D. in Electrical and Computer Engineering

    University of Florida
    Advisor: Prof. Xiaoyi Lu
  2. Ph.D. in Electrical Engineering and Computer Science

    University of California, Merced
    Advisor: Prof. Xiaoyi Lu
    Transferring to the University of Florida in January 2026.
  3. B.E. (with Honors), Computer Science and Technology

    Tianjin University
Skills
Programming
Python
C/C++
CUDA
Systems & Tools
MPI / NCCL
PyTorch
DeepSpeed / Megatron
vLLM / Ray
SimGrid
Profiling (PyTorch Profiler, Nsight Systems)
Awards
Student Travel Grant (Q-CORE @ QCE 2025)
QCE 2025 ∙ August 2025
B.E. with Honors
Tianjin University ∙ July 2024
Talent Student in Scientific and Technological Research
Tianjin University ∙ September 2023
The 7th Undergraduate Integrated Circuits Innovation and Entrepreneurship Competition
Competition ∙ June 2023
Member, Undergraduate Academic Committee
Tianjin University ∙ September 2020
September 2020 - July 2021