I work on high performance computing, focusing primarily on accelerated computing and programming models for heterogeneous architectures. My research helps HPC developers write speed of light applications and kernels that take advantage of modern hardware capabilities in a way that does not make them want to pull their hair out.
My interests mostly lie at the intersection of hardware and software for accelerated computing. I have worked at various hardware shops to this end including Arm, Cerebras Systems, and Nvidia on hardware modeling, high performance kernels, and library design. Good folks at Oak Ridge national lab’s OLCF remain my long time collaborators on application level projects.
I am happy to announce that I will be joining Nvidia Research for an extended internship starting January 2022!
Supercomputing 2020. ACM Gordon Bell award finalist.