Kokkos GPU Implementation of CPU-Based BLAS/LAPACK Operations and RandBLAS Randomization (EECS-2025-58)
Rahul Shah and James Demmel

Sparsity-aware communication for distributed graph neural network training (EECS-2023-253)
Ujjaini Mukhopadhyay

Parallelizing Irregular Applications for Distributed Memory Scalability: Case Studies from Genomics (EECS-2020-133)
Marquita Ellis