About
Introduction
I am a PhD candidate in the School of Computer Science at Peking University, advised by Prof. Yun Liang. My research interests include programming language, compiler design, high-performance computing, and system-level optimization for machine learning.
Education
- 2021~ | PhD Candidate, School of Computer Science, Peking University, Beijing, China
- 2017~2021 | Bachelor of Science, School of Computer Science, Peking University, Beijing, China
Awards
- 2024 | ByteDance Scholarship, ByteDance, China
- 2022~2023 | Schlumberger Scholarship, School of Computer Science, Peking University, Beijing, China
- 2019~2020 | PKU Second Class Scholarship, Peking University, Beijing, China
- 2019~2020 | Merit Student, Peking University, Beijing, China
- 2017~2018 | Merit Student, Peking University, Beijing, China
Publications
(“*” means equal contribution)
Venue | Title & Links | Author List |
---|---|---|
| ||
NIPS’24 | ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction [site][code][paper][slides][poster] | Renze Chen, Zhuofeng Wang, Beiquan Cao, Tong Wu, Size Zheng, Xiuhong Li, Xuechao Wei, Shengen Yan, Meng Li, Yun Liang |
| ||
ICCAD’24 | MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers [site][paper] | Zebin Yang*, Renze Chen*, Taiqiang Wu, Ngai Wong, Yun Liang, Runsheng Wang, Ru Huang, Meng Li |
| ||
ICCAD’24 | FlexHE: a Flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference [site][paper] | Jiangrui Yu, Wenxuan Zeng, Tianshi Xu, Renze Chen, Yun Liang, Runsheng Wang, Ru Huang, Meng Li |
| ||
DAC’24 | MoteNN: Memory Optimization via Fine-grained Scheduling for Deep Neural Networks on Tiny Devices [site][paper][slides][poster] | Renze Chen, Zijian Ding, Size Zheng, Meng Li, and Yun Liang |
| ||
MLSys’24 | vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs [site][paper][slides] | Size Zheng*, Renze Chen*, Meng Li, Zihao Ye, Luis Ceze, and Yun Liang |
| ||
ASPLOS’24 | MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN [site][code][paper][slides][poster] | Renze Chen, Zijian Ding, Size Zheng, Chengrui Zhang, Jingwen Leng, Xuanzhe Liu, and Yun Liang |
| ||
HPCA’23 | Chimera: An Analytical Optimizing Framework for Effective Compute-intensive Operators Fusion [site][paper] | Size Zheng, Siyuan Chen, Peidi Song, Renze Chen, Xiuhong Li, Shengen Yan, Dahua Lin, Jingwen Leng, and Yun Liang |
| ||
ISCA’22 | AMOS: Enabling Automatic Mapping for Tensor Computations On Spatial Accelerators with Hardware Abstraction [site][code][paper] | Size Zheng, Renze Chen, Anjiang Wei, Yicheng Jin, Qin Han, Liqiang Lu, Bingyang Wu, Xiuhong Li, Shengen Yan, and Yun Liang |
| ||
TPDS’22 | NeoFlow: A Flexible Framework for Enabling Efficient Compilation for High Performance DNN Training [site][paper] | Size Zheng, Renze Chen, Yicheng Jin, Anjiang Wei, Bingyang Wu, Xiuhong Li, Shengen Yan, and Yun Liang |
| ||
ASPLOS’20 | FlexTensor: An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System [site][code][paper] | Size Zheng, Yun Liang, Shuo Wang, Renze Chen, and Kaiwen Sheng |
Projects
Name | Description |
---|---|
| |
ArkVale | ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS’24). |
| |
MAGIS | MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS’24). |
| |
AMOS | AMOS: Enabling Automatic Mapping for Tensor Computations On Spatial Accelerators with Hardware Abstraction (ISCA’23). |
| |
FlexTensor | FlexTensor: An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System (ASPLOS’20). |
| |
CCTV | CCTV: C++ Compile-Time EValuator for Scheme LISP. An interpreter for a tiny dialect of Scheme LISP language implemented with C++ template meta-programming. |
| |
CppyList | A C++ library of python-like heterogeneous list. |
| |
CppFP | A C++ library for “curry”, “partial” and some other functional programming combinators. |
| |
ECAIA | Implementation of “Essentials of Compilation: An Incremental Approach” with Racket language. |
| |
AVL-Tree | An AVL-Tree implementation with linear-time merging operation. |
| |
CoCo | A simple symmetry coroutine library for POSIX C. |
Coursework
Name | Description |
---|---|
| |
Rust-Linear | Project of Design Principles of Programming Language (2022). Implementation of a flexible linear type system with Rust language. |
| |
Shift-Reset | Project of Design Principles of Programming Language (2020). Implementation of paper “Selective CPS Transformation for shift and reset” with OCaml language. |
| |
MIT-JOS | Lab. of Operating System (Honor Track). Implementation of a tiny micro-kernel OS. |
| |
MiniC | Lab. of Practice for Compiler Design. Implementation of a C compiler from scratch. |