오늘은 맑음

NPU(Neural Processing Unit), AI Acclerator 관련 논문 정리 [1] algorithm 본문

NPU

NPU(Neural Processing Unit), AI Acclerator 관련 논문 정리 [1] algorithm

자전거 타는 구구 2020. 7. 10. 17:00
반응형

[1] algorithm

Anwar, Sajid, Kyuyeon Hwang, and Wonyong Sung. "Fixed point optimization of deep convolutional neural networks for object recognition." 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015.

Gupta, Suyog, et al. "Deep learning with limited numerical precision." International Conference on Machine Learning. 2015.

Lin, Darryl, Sachin Talathi, and Sreekanth Annapureddy. "Fixed point quantization of deep convolutional networks." International conference on machine learning. 2016.

Lin, Zhouhan, et al. "Neural networks with few multiplications." arXiv preprint arXiv:1510.03009 (2015).

Courbariaux, Matthieu, Yoshua Bengio, and Jean-Pierre David. "Binaryconnect: Training deep neural networks with binary weights during propagations." Advances in neural information processing systems. 2015.

Hubara, Itay, et al. "Binarized neural networks: Training neural networks with weights and activations constrained to+ 1 or-1." arXiv preprint arXiv:1602.02830 (2016).

Iandola, Forrest N., et al. "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size." arXiv preprint arXiv:1602.07360 (2016).

Chen, Wenlin, et al. "Compressing convolutional neural networks in the frequency domain." Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016.

Chen, Wenlin, et al. "Compressing neural networks with the hashing trick." International conference on machine learning. 2015.

Sindhwani, Vikas, Tara Sainath, and Sanjiv Kumar. "Structured transforms for small-footprint deep learning." Advances in Neural Information Processing Systems. 2015.

Cireşan, Dan C., et al. "High-performance neural networks for visual object classification." arXiv preprint arXiv:1102.0183 (2011).

Denil, Misha, et al. "Predicting parameters in deep learning." Advances in neural information processing systems. 2013.

Han, Song, Huizi Mao, and William J. Dally. "Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding." arXiv preprint arXiv:1510.00149 (2015).

Hinton, Geoffrey, Oriol Vinyals, and Jeff Dean. "Distilling the knowledge in a neural network." arXiv preprint arXiv:1503.02531 (2015).

반응형
Comments