Song Han, Huizi Mao, William J. Dally

Abstract

Introduction

Network Pruning

Trained Quantization and Weight Sharing