Biography
I am a CPU architect working on high-performance RISC-V processors at Tenstorrent. I received my Ph.D. degree in Electrical and Computer Engineering from Cornell University, advised by Prof. Christopher Batten. My Ph.D. research focused on efficient parallel framework for heterogeneous multi-/many-core systems, area-/power-efficient support for next-generation vector architectures, and sparse matrix computation on matrix architectures.
Prior to joining Cornell, I received my Bachelors in Computer Science from University of Mississippi in 2016. I worked as a research co-op at AMD Research on modeling AMD’s next-generation GPU’s cache system and developing a cache coherence testing framework in gem5 simulator in 2017. During my PhD, I interned at Arm Research to explore wafer-scale many-core architecture and Arm’s Scalable Matrix Extension (SME).
Here is my CV (Updated on 09/08/2024).
Patent
- Joshua Randall, Jesse Garrett Beu, Krishnendra Nathella, Tuan Quang Ta. Vectorized Operations for Sparse Kernels, US 20230367843A1, Nov. 2023.
Selected Publications
Ting-Jung Chang, Ang Li, Fei Gao, Tuan Ta, Georgios Tziantzioulis, Yanghui Ou, Moyang Wang, Jinzheng Tu, Kaifeng Xu, Paul Jackson, August Ning, Grigory Chirkov, Marcelo Orenes-Vera, Shady Agwa, Xiaoyu Yan, Eric Tang, Jonathan Balkind, Christopher Batten, and David Wentzlaff. CIFER: An Open-Source, 12nm, 16mm2 SoC with Four 64-bit OS-Capable RISC-V Processors, 18 32-bit RISC-V Tiny Cores, and Coherently-Integrated eFPGA. IEEE Custom Integrated Circuits Conference (CICC), Apr. 2023.
Khalid Al-Hawaj, Tuan Ta, Nick Cebry, Shady Agwa, Olalekan Afuye, Eric Hall, Courtney Golden, Alyssa Apsel, Christopher Batten. EVE: Ephemeral Vector Engines IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2023.
Tuan Ta, Khalid Al-Hawaj, Nick Cebry, Yanghui Ou, Eric Hall, Courtney Golden, and Christopher Batten. big.VLITTLE: On-Demand Data-Parallel Acceleration for Mobile Systems on Chip. International Symposium on Microarchitecture (MICRO), 2022.
Lingjun Zhu, Tuan Ta, Rossana Liu, Rahul Mathur, Xiaoqing Xu, Shidhartha Das, Ankit Kaul, Alejandro Rico, Doug Joseph, Brian Cline, Sung Kyu Lim. Power Delivery and Thermal-Aware Arm-Based Multi-Tier 3D Architecture. IEEE/ACM International Symposium on Low Power Electronics and Design, July 2021.
Moyang Wang, Tuan Ta, Lin Cheng, and Christopher Batten. Efficiently Supporting Dynamic Task Parallelism on Heterogeneous Cache-Coherent Systems. 47th ACM/IEEE International Symposium on Computer Architecture (ISCA), June 2020.
Tuan Ta, Xianwei Zhang, Anthony Gutierrez, and Bradford M. Beckmann. Autonomous Data-Race-Free GPU Testing. IEEE International Symposium on Workload Characterization (IISWC 2019).
David Troendle, Tuan Ta, and Byunghyun Jang. A Specialized Concurrent Queue for Scheduling Irregular Workloads on GPUs. In the 48th International Conference on Parallel Processing (ICPP 2019). (Link)
Christopher Torng, Shunning Jiang, Khalid Al-Hawaj, Ivan Bukreyev, Berkin Ilbeyi, Tuan Ta, Lin Cheng, Julian Puscar, Ian Galton, and Christopher Batten. A New Era of Silicon Prototyping in Computer Architecture Research. RISC-V Day Workshop held in conjunction with MICRO-51, Oct. 2018. (Link)
Tuan Ta, Lin Cheng, and Christopher Batten. Simulating Multi-Core RISC-V Systems in gem5. In the 2nd Workshop on Computer Architecture Research with RISC-V (CARRV 2018). (Link)
Elliott Samuel, Raghu Raj Prasanna Kumar, Natasha Flyer, Tuan Ta, and Richard Loft. Implementation of a Scalable, Performance Portable Shallow Water Equation Solver Using Radial Basis Function-Generated Finite Difference Methods. In the International Journal of High Performance Computing Applications (IJHPCA 2018). (Link)
Tuan Ta, David Troendle, Xiaoqi Hu, Byunghyun Jang. Understanding the Impact of Fine-Grained Data Sharing and Thread Communication on Heterogeneous Workload Development. In the 16th IEEE International Symposium on Parallel & Distributed Computing (ISPDC 2017). (Link)
Tuan Ta, David Troendle, and Byunghyun Jang. Thread Communication and Synchronization on Massively Parallel GPUs. A book chapter in Advances in GPU Research and Practice edited by Hamid Sarbazi-Azad. (Link)
Tuan Ta, Kyoshin Choo, Eh Tan, Byunghyun Jang, Eunseo Choi. Accelerating DynEarthSol3D on Tightly Coupled CPU-GPU Heterogeneous Processors. In Computers & Geosciences Journal (2015). (Link)
Contributions to Open-Source Projects
- List of patches contributed to gem5 repository