Vijay Thakkar

I’m a graduate student in the College of Computing at Georgia Institute of Technology, advised by Dr. Rich Vuduc. Currently, I work at NVIDIA full time while I finish my PhD. At NVIDIA, I collaborate closely with NVResearch, compiler, GPU arch, and our customers to lead the design of next generation linear algebra libraries, namely, CUTLASS 3.0, a project I have been working on since its inception.

Throughout my academic and professional career, my research has focused on the intersection of programming models for accelerated computing and performance engineering to make GPUs go brrrr. My research helps HPC developers write speed of light applications and kernels that take advantage of modern hardware capabilities in a way that does not make them want to pull their hair out.

I was very lucky to get involved at the inception stage of CUTLASS 3.x and work closely with [Cris Cecka][ccekca] for two years and champion its design and adoption from then on. These days I still work on the CUTLASS C++ project but but in a much broader capacity. Here are some things I get to do:

CUTLASS 3.x and beyond
Tensor core architecture and PTX / CUDA C++ programming model code-sign
MLIR dialects and compilers for GPU code generation
Design of new hardware features for future generations of GPUs
Direct customer assistance for using CUTLASS and targeting tensor cores
Collaborations with internal and external research teams on publications (FlashAttention 3, fVDB etc.)
Staffing, project priority and planning, recruiting.

I have also worked at Arm and Cerebras Systems on hardware modeling, high performance kernels, and library design. Good folks at Oak Ridge national lab’s OLCF were my collaborators on application level projects during my time at GaTech, who I was lucky to have the honor of sharing two Gordon Bell award nominations with.

News
[Subscribe]

15 July 2022

After having worked as an intern for nearly a year and half at NVIDIA, I have decided to join full time as a compute architect in the DL architecture group, in the fast kernels team!

I will continue my collaboration with Cris Cecka from NVR PSA to lead the design of next generation linear algebra library, CUTLASS 3.0, which I will also be basing much of my PhD thesis material on.

7 October 2021

I am happy to announce that I will be joining Nvidia Research for an extended internship starting January 2022!

Older News...

Publications
[DBLP]

[Google Scholar]

Conference Papers

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision.

Jay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao.

NeurIPS 2024.
- Paper (PDF)
fVDB : A Deep-Learning Framework for Sparse, Large Scale, and High Performance Spatial Intelligence.

Francis Williams, Jiahui Huang, Jonathan Swartz, Gergely Klar, Vijay Thakkar, Matthew Cong, Xuanchi Ren, Ruilong Li, Clement Fuji-Tsang, Sanja Fidler, Eftychios Sifakis, Ken Museth.

SIGGRAPH 2024 -.
- Paper (PDF)
Exaflops Biomedical Knowledge Graph Analytics.

Ramakrishnan Kannan, Piyush Sao, Hao Lu, Jakub Kurzak, Gundolf Schenk, Yongmei Shi, Seung-Hwan Lim, Sharat Israni, Vijay Thakkar, Guojing Cong, Robert Patton, Sergio E. Baranzini, Richard Vuduc, Thomas Potok.

Supercomputing 2022. ACM Gordon Bell award finalist.
- Paper (PDF)
Scalable all-pairs shortest paths for huge graphs on multi-GPU clusters.

Piyush Sao, Hao Lu, Ramakrishnan Kannan, Vijay Thakkar, Richard Vuduc, Thomas Potok.

HPDC 2021.
- Paper (PDF)
Scalable knowledge graph analytics at 136 petaflop/s.

Ramakrishnan Kannan, Piyush Sao, Hao Lu, Drahomira Herrmannova, Vijay Thakkar, Robert Patton, Richard Vuduc, Thomas Potok.

Supercomputing 2020. ACM Gordon Bell award finalist.
- Paper (PDF)
Conditioning deep generative raw audio models for structured automatic music.

Vijay Thakkar, Rachel Manzelli, Ali Siahkamari, and Brian Kulis.

ISMIR 2018.
- Paper (PDF)

Workshop Papers

An end to end model for automatic music generation: Combining deep raw and symbolic audio networks.

Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, and Brian Kulis.

MuME 2018.
- Paper (PDF)

News [Subscribe]

Publications [DBLP] [Google Scholar]

Conference Papers

Workshop Papers

News
[Subscribe]

Publications
[DBLP]

[Google Scholar]