Aligning visual regions and textual concepts for semantic-grounded image representations F Liu, Y Liu, X Ren, X He, X Sun Proceedings of NeurIPS 2019, 2019 | 134 | 2019 |
simnet: Stepwise image-topic merging network for generating detailed and comprehensive image captions F Liu, X Ren, Y Liu, H Wang, X Sun Proceedings of EMNLP 2018 (Oral), 2018 | 121 | 2018 |
Exploring and distilling cross-modal information for image captioning F Liu, X Ren, Y Liu, K Lei, X Sun Proceedings of IJCAI 2019 (Oral), 2020 | 77 | 2020 |
Fetv: A benchmark for fine-grained evaluation of open-domain text-to-video generation Y Liu, L Li, S Ren, R Gao, S Li, S Chen, X Sun, L Hou Advances in Neural Information Processing Systems 36, 2024 | 52 | 2024 |
Tempcompass: Do video llms really understand videos? Y Liu, S Li, Y Liu, Y Wang, S Ren, L Li, S Chen, X Sun, L Hou arXiv preprint arXiv:2403.00476, 2024 | 50 | 2024 |
Towards robust visual question answering: Making the most of biased samples via contrastive learning Q Si, Y Liu, F Meng, Z Lin, P Fu, Y Cao, W Wang, J Zhou arXiv preprint arXiv:2210.04563, 2022 | 31 | 2022 |
Language prior is not the only shortcut: A benchmark for shortcut learning in vqa Q Si, F Meng, M Zheng, Z Lin, Y Liu, P Fu, Y Cao, W Wang, J Zhou arXiv preprint arXiv:2210.04692, 2022 | 25 | 2022 |
Generating paraphrase with topic as prior knowledge Y Liu, Z Lin, F Liu, Q Dai, W Wang Proceedings of the 28th ACM International Conference on Information and …, 2019 | 22 | 2019 |
Self-adaptive scaling for learnable residual structure F Liu, M Gao, Y Liu, K Lei Proceedings of the 23rd Conference on Computational Natural Language …, 2019 | 21 | 2019 |
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models L Yao, L Li, S Ren, L Wang, Y Liu, X Sun, L Hou arXiv preprint arXiv:2405.20985, 2024 | 19 | 2024 |
Rosita: Refined bert compression with integrated techniques Y Liu, Z Lin, F Yuan Proceedings of the AAAI Conference on Artificial Intelligence 35 (10), 8715-8722, 2021 | 19 | 2021 |
Vitatecs: A diagnostic dataset for temporal concept understanding of video-language models S Li, L Li, Y Liu, S Ren, Y Liu, R Gao, X Sun, L Hou European Conference on Computer Vision, 331-348, 2024 | 14 | 2024 |
Connecting targets via latent topics and contrastive learning: A unified framework for robust zero-shot and few-shot stance detection R Liu, Z Lin, P Fu, Y Liu, W Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
Learning to win lottery tickets in BERT transfer via task-agnostic mask training Y Liu, F Meng, Z Lin, P Fu, Y Cao, W Wang, J Zhou arXiv preprint arXiv:2204.11218, 2022 | 11 | 2022 |
Ranking and sampling in open-domain question answering Y Xu, Z Lin, Y Liu, R Liu, W Wang, D Meng Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 8 | 2019 |
Learning class-transductive intent representations for zero-shot intent detection Q Si, Y Liu, P Fu, Z Lin, J Li, W Wang arXiv preprint arXiv:2012.01721, 2020 | 7 | 2020 |
Unsupervised pre-training for natural language generation: a literature review Y Liu, Z Lin arXiv preprint arXiv:1911.06171, 2019 | 7 | 2019 |
A win-win deal: Towards sparse and robust pre-trained language models Y Liu, F Meng, Z Lin, J Li, P Fu, Y Cao, W Wang, J Zhou Advances in Neural Information Processing Systems 35, 19189-19202, 2022 | 4 | 2022 |
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation Y Liu, F Meng, Z Lin, W Wang, J Zhou ACL-IJCNLP 2021, 2021 | 4 | 2021 |
Aligning visual regions and textual concepts: Learning fine-grained image representations for image captioning F Liu, Y Liu, X Ren, K Lei, X Sun arXiv preprint arXiv:1905.06139, 2019 | 4 | 2019 |