Follow
Markus Nagel
Markus Nagel
Qualcomm AI Research
Verified email at qualcomm.com
Title
Cited by
Cited by
Year
Data-Free Quantization through Weight Equalization and Bias Correction
M Nagel, M Baalen, T Blankevoort, M Welling
Proceedings of the IEEE International Conference on Computer Vision, 1325-1334, 2019
5922019
A White Paper on Neural Network Quantization
M Nagel, M Fournarakis, RA Amjad, Y Bondarenko, M van Baalen, ...
arXiv preprint arXiv:2106.08295, 2021
5552021
Up or Down? Adaptive Rounding for Post-Training Quantization
M Nagel, RA Amjad, M van Baalen, C Louizos, T Blankevoort
Proceedings of the 37th International Conference on Machine Learning, 2020
5292020
LSQ+: Improving low-bit quantization through learnable offsets and better initialization
Y Bhalgat, J Lee, M Nagel, T Blankevoort, N Kwak
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
2472020
Bayesian bits: Unifying quantization and pruning
M Van Baalen, C Louizos, M Nagel, RA Amjad, Y Wang, T Blankevoort, ...
Advances in neural information processing systems 33, 5741-5752, 2020
1372020
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Y Bondarenko, M Nagel, T Blankevoort
arXiv preprint arXiv:2109.12948, 2021
1322021
Overcoming Oscillations in Quantization-Aware Training
M Nagel, M Fournarakis, Y Bondarenko, T Blankevoort
International Conference on Machine Learning, 16318-16330, 2022
932022
Fp8 quantization: The power of the exponent
A Kuzmin, M Van Baalen, Y Ren, M Nagel, J Peters, T Blankevoort
Advances in Neural Information Processing Systems 35, 14651-14662, 2022
672022
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
Y Bondarenko, M Nagel, T Blankevoort
Advances in Neural Information Processing Systems 36, 2023
642023
Implicit Neural Video Compression
Y Zhang, T van Rozendaal, J Brehmer, M Nagel, T Cohen
arXiv preprint arXiv:2112.11312, 2021
562021
Beam Loss Monitoring for LHC Machine Protection
EB Holzer, B Dehning, E Effnger, J Emery, V Grishin, C Hajdu, S Jackson, ...
Physics Procedia 37, 2055-2062, 2012
412012
Pruning vs Quantization: Which is Better?
A Kuzmin, M Nagel, M Van Baalen, A Behboodi, T Blankevoort
Advances in Neural Information Processing Systems 36, 2023
362023
Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)
S Siddegowda, M Fournarakis, M Nagel, T Blankevoort, C Patel, ...
arXiv preprint arXiv:2201.08442, 2022
342022
Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams.
M Nagel, T Mensink, CGM Snoek
BMVC 2, 6, 2015
312015
FP8 versus INT8 for efficient deep learning inference
M van Baalen, A Kuzmin, SS Nair, Y Ren, E Mahurin, C Patel, ...
arXiv preprint arXiv:2303.17951, 2023
302023
Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks
A Kuzmin, M Nagel, S Pitre, S Pendyam, T Blankevoort, M Welling
arXiv preprint arXiv:1912.09802, 2019
252019
The LLM Surgeon
TFA van der Ouderaa, M Nagel, M van Baalen, YM Asano, T Blankevoort
The Twelfth International Conference on Learning Representations (ICLR), 2023
172023
Cyclical Pruning for Sparse Neural Networks
S Srinivas, A Kuzmin, M Nagel, M van Baalen, A Skliar, T Blankevoort
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
172022
Quantization Robust Federated Learning for Efficient Inference on Heterogeneous Devices
K Gupta, M Fournarakis, M Reisser, C Louizos, M Nagel
arXiv preprint arXiv:2206.10844, 2022
162022
MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device
T van Rozendaal, T Singhal, H Le, G Sautiere, A Said, K Buska, A Raha, ...
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
132024
The system can't perform the operation now. Try again later.
Articles 1–20