Biography

I am a CPU architect working on high-performance RISC-V processors at Tenstorrent. I received my Ph.D. degree in Electrical and Computer Engineering from Cornell University, advised by Prof. Christopher Batten. My Ph.D. research focused on efficient parallel framework for heterogeneous multi-/many-core systems, area-/power-efficient support for next-generation vector architectures, and sparse matrix computation on matrix architectures.

Prior to joining Cornell, I received my Bachelors in Computer Science from University of Mississippi in 2016. I worked as a research co-op at AMD Research on modeling AMD’s next-generation GPU’s cache system and developing a cache coherence testing framework in gem5 simulator in 2017. During my PhD, I interned at Arm Research to explore wafer-scale many-core architecture and Arm’s Scalable Matrix Extension (SME).

Here is my CV (Updated on 09/08/2024).

Patent

  • Joshua Randall, Jesse Garrett Beu, Krishnendra Nathella, Tuan Quang Ta. Vectorized Operations for Sparse Kernels, US 20230367843A1, Nov. 2023.

Selected Publications

  • Ting-Jung Chang, Ang Li, Fei Gao, Tuan Ta, Georgios Tziantzioulis, Yanghui Ou, Moyang Wang, Jinzheng Tu, Kaifeng Xu, Paul Jackson, August Ning, Grigory Chirkov, Marcelo Orenes-Vera, Shady Agwa, Xiaoyu Yan, Eric Tang, Jonathan Balkind, Christopher Batten, and David Wentzlaff. CIFER: An Open-Source, 12nm, 16mm2 SoC with Four 64-bit OS-Capable RISC-V Processors, 18 32-bit RISC-V Tiny Cores, and Coherently-Integrated eFPGA. IEEE Custom Integrated Circuits Conference (CICC), Apr. 2023.

  • Khalid Al-Hawaj, Tuan Ta, Nick Cebry, Shady Agwa, Olalekan Afuye, Eric Hall, Courtney Golden, Alyssa Apsel, Christopher Batten. EVE: Ephemeral Vector Engines IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2023.

  • Tuan Ta, Khalid Al-Hawaj, Nick Cebry, Yanghui Ou, Eric Hall, Courtney Golden, and Christopher Batten. big.VLITTLE: On-Demand Data-Parallel Acceleration for Mobile Systems on Chip. International Symposium on Microarchitecture (MICRO), 2022.

  • Lingjun Zhu, Tuan Ta, Rossana Liu, Rahul Mathur, Xiaoqing Xu, Shidhartha Das, Ankit Kaul, Alejandro Rico, Doug Joseph, Brian Cline, Sung Kyu Lim. Power Delivery and Thermal-Aware Arm-Based Multi-Tier 3D Architecture. IEEE/ACM International Symposium on Low Power Electronics and Design, July 2021.

  • Moyang Wang, Tuan Ta, Lin Cheng, and Christopher Batten. Efficiently Supporting Dynamic Task Parallelism on Heterogeneous Cache-Coherent Systems. 47th ACM/IEEE International Symposium on Computer Architecture (ISCA), June 2020.

  • Tuan Ta, Xianwei Zhang, Anthony Gutierrez, and Bradford M. Beckmann. Autonomous Data-Race-Free GPU Testing. IEEE International Symposium on Workload Characterization (IISWC 2019).

  • David Troendle, Tuan Ta, and Byunghyun Jang. A Specialized Concurrent Queue for Scheduling Irregular Workloads on GPUs. In the 48th International Conference on Parallel Processing (ICPP 2019). (Link)

  • Christopher Torng, Shunning Jiang, Khalid Al-Hawaj, Ivan Bukreyev, Berkin Ilbeyi, Tuan Ta, Lin Cheng, Julian Puscar, Ian Galton, and Christopher Batten. A New Era of Silicon Prototyping in Computer Architecture Research. RISC-V Day Workshop held in conjunction with MICRO-51, Oct. 2018. (Link)

  • Tuan Ta, Lin Cheng, and Christopher Batten. Simulating Multi-Core RISC-V Systems in gem5. In the 2nd Workshop on Computer Architecture Research with RISC-V (CARRV 2018). (Link)

  • Elliott Samuel, Raghu Raj Prasanna Kumar, Natasha Flyer, Tuan Ta, and Richard Loft. Implementation of a Scalable, Performance Portable Shallow Water Equation Solver Using Radial Basis Function-Generated Finite Difference Methods. In the International Journal of High Performance Computing Applications (IJHPCA 2018). (Link)

  • Tuan Ta, David Troendle, Xiaoqi Hu, Byunghyun Jang. Understanding the Impact of Fine-Grained Data Sharing and Thread Communication on Heterogeneous Workload Development. In the 16th IEEE International Symposium on Parallel & Distributed Computing (ISPDC 2017). (Link)

  • Tuan Ta, David Troendle, and Byunghyun Jang. Thread Communication and Synchronization on Massively Parallel GPUs. A book chapter in Advances in GPU Research and Practice edited by Hamid Sarbazi-Azad. (Link)

  • Tuan Ta, Kyoshin Choo, Eh Tan, Byunghyun Jang, Eunseo Choi. Accelerating DynEarthSol3D on Tightly Coupled CPU-GPU Heterogeneous Processors. In Computers & Geosciences Journal (2015). (Link)

Contributions to Open-Source Projects

  • List of patches contributed to gem5 repository