Systems and Architecture for ML
Recent Publications
Systems and Hardware Support
[ISCA 2023] OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
[MICRO 2022] ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
[IISWC 2021] Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators
[ISPASS 2020] A Systematic Methodology for Characterizing Scalability of DNN Accelerators using SCALE-Sim