About Sangun Choi

Sangun_profile

I am a fifth-year Ph.D. student at Korea University. My research interests broadly encompass improving the energy efficiency of computing systems, including neural processing unit (NPU) architectures, graphics processing unit (GPU) architectures, and memory systems for future datacenter servers.

My current work focuses on developing energy-efficient on-chip memory systems for NPUs. NPUs implement their on-chip memory predominantly with SRAM, which occupies over 40% of the total chip area. Despite this substantial silicon investment, existing systems fall short of utilizing these memory resources in an energy-efficient manner. This challenge motivates the need for novel on-chip memory designs and management techniques that offer sustainable scalability and adaptability to the diverse characteristics of emerging workloads.

Please refer to our publications for more details.

Research Interests

  • Sustainable NPU Architecture
  • Future Memory Systems
  • Software Techniques for GPUs

Publications

  • [IEEE CAL] EONSim: An NPU Simulator for On-Chip Memory and Embedding Vector Operations (First Author)
  • [IEEE TC] Bauhaus: Restructuring Vector Database for LLM Retrieval on CXL-Based Tiered Memory (Co-Author)
  • [IISWC] HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks (Co-Author)
  • [IEEE CAL] MOST: Memory Oversubscription-aware Scheduling for Tensor Migration on GPU Unified Storage (Co-Author)
  • [LCTES] Kubism: Disassembling and Reassembling K-Means Clustering for Mobile Heterogeneous Platforms (First Author)
  • [ACM TOS] TM-Training: An Energy-Efficient Tiered Memory System for Deep Learning Training in NPUs (Co-Author)
  • [IEEE ESL] TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs (Co-Author)
  • [IEEE Access] SAVector: Vectored Systolic Arrays (First Author)