Ruobing Han

hanruobing@google.com

I am a software engineer at Google Cloud. My research areas include compilers, architecture, and ML systems. I received my Ph.D. in Computer Science from Georgia Tech in 2025, advised by Prof. Hyesoon Kim. I also received my B.S. degree from the College of EECS at Peking University in 2018.


Education

Georgia Institute of Technology

Ph.D, Computer Science
May 2021 - May 2025

Peking University

Bachelor of Science, Computer Science and Technology
Sep 2014 - Aug 2018

Research Experience

HPArch, Georgia Institute of Technology

Graduate Research Assistant

Advisor: Prof. Hyesoon Kim

May 2021 - May 2025

HPC-AI, Nation University of Singapore

Research Assistant

Advisor: Prof. Yang You

May 2020 - Apr 2021

Work Experience

Google, Mountain View, USA

Software Engineer, TPU performance team
May 2025 - Present

Google, Sunnyvale, USA

Software Engineering Intern, TPU performance team and XLA compiler team
May 2024 - Jul 2024

Google, Sunnyvale, USA

Software Engineering Intern, Debug toolkit team
May 2023 - Jul 2023

Google, Sunnyvale, USA

Software Engineering Intern, LLVM core team
May 2022 - Aug 2022

Publication

  • SoftCUDA: Running CUDA on Softcore GPU
    Chihyo Ahn, Ruobing Han, Udit Subramanya, Jisheng Zhao, Blaise Tine, Hyesoon Kim IEEE Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2025
    paper
  • Unleashing CPU Potential for Executing GPU Programs through Compiler/Runtime Optimizations
    Ruobing Han, Jisheng Zhao, Hyesoon Kim
    IEEE/ACM International Symposium on Microarchitecture (MICRO), 2024
    paper
  • CuPBoP: Making CUDA a Portable Language
    Ruobing Han, Jun Chen, Bhanu Garg, Xule Zhou, John Lu, Jeffrey Young, Jaewoong Sim, Hyesoon Kim
    ACM Transactions on Design Automation of Electronic Systems (TODAES), 2024
    paper
  • Exponentially Expanding the Phase-Ordering Search Space via Dormant Information
    Ruobing Han, Hyesoon Kim
    International Conference on Compiler Construction (CC), 2024
    paper
  • Enabling Fine-Grained Incremental Builds By Making Compiler Stateful
    Ruobing Han, Jisheng Zhao, Hyesoon Kim
    International Symposium on Code Generation and Optimization (CGO), 2024
    paper
  • COX: Exposing CUDA Warp-Level Functions to CPUs
    Ruobing Han, Jaewon Lee, Jaewoong Sim, Hyesoon Kim
    ACM Transactions on Architecture and Code Optimization (TACO), 2022
    paper
  • Supporting CUDA for an extended RISC-V GPU architecture
    Ruobing Han, Blaise Tine, Jaewon Lee, Jaewoong Sim, Hyesoon Kim
    the Fifth Workshop on RISC-V for ComputerArchitecture Research, 2021
    paper
  • Dynamic scaling for low-precision learning
    Ruobing Han, Min Si, James Demmel, Yang You
    the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
    paper
  • Auto-Precision Scaling for Distributed Deep Learning
    Ruobing Han, James Demmel, Yang You
    International Conference on High Performance Computing, 2021
    paper
  • Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes
    Peng Sun, Wansen Feng, Ruobing Han, Shengen Yan, Yonggang Wen
    IEEE Transactions on Big Data, 2020
    paper

Presentation

  • CuPBoP: CUDA for Parallelized and Broad-range Processors
    San Jose, California, USA
    The LLVM Developers' Meeting, 2022
    link