About Sangun Choi

Sangun_profile

I am a 5th year Ph.D student at Korea University. My research interest lies in broad topics for improving the energy efficiency of computing systems, including neural processing unit (NPU) architectures, graphics processing unit (GPU) architectures, and memory systems of future datacenter servers.

I'm focusing on developing an energy-efficient on-chip memory system for NPUs. NPUs implement their on-chip memory systems primarily with SRAM, consuming over 40% of the total chip area. Despite this substantial silicon investment, existing systems struggle to utilize these memory resources energy-efficiently. Therefore, NPUs necessitate a novel on-chip memory system design and management technique that offers a sustainable scalability and adaptability to diverse characteristics of emerging workloads.

Please check our publications for more details!

Research Interests

  • Sustainable NPU Architecture
  • Future Memory Systems
  • Software Techniques for GPUs

Publications

  • [IEEE CAL] EONSim: An NPU Simulator for On-Chip Memory and Embedding Vector Operations (First Author)
  • [IEEE TC] Bauhaus: Restructuring Vector Database for LLM Retrieval on CXL-Based Tiered Memory (Co-Author)
  • [IISWC] HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks (Co-Author)
  • [IEEE CAL] MOST: Memory Oversubscription-aware Scheduling for Tensor Migration on GPU Unified Storage (Co-Author)
  • [LCTES] Kubism: Disassembling and Reassembling K-Means Clustering for Mobile Heterogeneous Platforms (First Author)
  • [ACM TOS] TM-Training: An Energy-Efficient Tiered Memory System for Deep Learning Training in NPUs (Co-Author)
  • [IEEE ESL] TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs (Co-Author)
  • [IEEE Access] SAVector: Vectored Systolic Arrays (First Author)