Improved fusion of visual and language representations by dense symmetric co-attention for visual question answering DK Nguyen, T Okatani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2018 | 345 | 2018 |
Multi-task learning of hierarchical vision-language representation DK Nguyen, T Okatani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 71 | 2019 |
Ur2kid: Unifying retrieval, keypoint detection, and keypoint description without local correspondence supervision TY Yang*, DK Nguyen*, H Heijnen, V Balntas arXiv preprint arXiv:2001.07252, 2020 | 41 | 2020 |
BoxeR: Box-Attention for 2D and 3D Transformers DK Nguyen, J Ju, O Booij, MR Oswald, CGM Snoek Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 40 | 2022 |
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond DK Nguyen, V Goswami, X Chen Proceedings of the International Conference on Learning Representations, 2021 | 33 | 2021 |
R-MAE: Regions Meet Masked Autoencoders DK Nguyen, V Aggarwal, Y Li, MR Oswald, A Kirillov, CGM Snoek, ... Proceedings of the International Conference on Learning Representations, 2024 | 6 | 2024 |
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels DK Nguyen, M Assran, U Jain, MR Oswald, CGM Snoek, X Chen arXiv preprint arXiv:2406.09415, 2024 | 2 | 2024 |
SimPLR: A Simple and Plain Transformer for Object Detection and Segmentation DK Nguyen, MR Oswald, CGM Snoek arXiv preprint arXiv:2310.05920, 2023 | | 2023 |