Junting Pan (潘俊廷)
Junting Pan (潘俊廷)
Multimedia Lab (MMLab), The Chinese University of Hong Kong
Verified email at - Homepage
Cited by
Cited by
Shallow and deep convolutional networks for saliency prediction
J Pan*, K McGuinness*, N O'Connor, E E Sayrol, X Giro-i-Nieto
Computer Vision and Pattern Recognition (CVPR) 2016, 598-606, 2016
Salgan: Visual saliency prediction with generative adversarial networks
J Pan, CC Ferrer, K McGuinness, NE O'Connor, J Torres, E Sayrol, ...
CVPR Scene Understanding Workshop (SUNw) 2017, 2017
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
J Pan, Z Lin, X Zhu, J Shao, H Li
Advances in Neural Information Processing Systems (NeurIPS), 2022
Actor-context-actor relation network for spatio-temporal action localization
J Pan, S Chen, MZ Shou, Y Liu, J Shao, H Li
Conference on Computer Vision and Pattern Recognition (CVPR) 2021, 464-474, 2021
Edgevits: Competing light-weight cnns on mobile devices with vision transformers
J Pan, A Bulat, F Tan, X Zhu, L Dudziak, H Li, G Tzimiropoulos, B Martinez
European Conference on Computer Vision (ECCV) 2022, 2022
Online detection of action start in untrimmed, streaming videos
J Pan*, Z Shou*, J Chan, K Miyazawa, H Mansour, A Vetro, X Giro-i Nieto, ...
European Conference on Computer Vision (ECCV) 2018, 534-551, 2018
Video Generation from Single Semantic Label Map
J Pan, C Wang, X Jia, J Shao, L Sheng, J Yan, X Wang
Computer Vision and Pattern Recognition (CVPR) 2019, 3733-3742, 2019
Personalize segment anything model with one shot
R Zhang, Z Jiang, Z Guo, S Yan, J Pan, X Ma, H Dong, P Gao, H Li
arXiv preprint arXiv:2305.03048, 2023
Videollm: Modeling video sequence with large language models
G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ...
arXiv preprint arXiv:2305.13292, 2023
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
Journeydb: A benchmark for generative image understanding
K Sun, J Pan, Y Ge, H Li, H Duan, X Wu, R Zhang, A Zhou, Z Qin, Y Wang, ...
Advances in Neural Information Processing Systems 36, 2024
Retrieving-to-answer: Zero-shot video question answering with frozen large language models
J Pan, Z Lin, Y Ge, X Zhu, R Zhang, Y Wang, Y Qiao, H Li
Proceedings of the IEEE/CVF International Conference on Computer Vision, 272-283, 2023
High-Quality Video Generation from Static Structural Annotations
L Sheng*, J Pan*, J Guo, J Shao, CC Loy
International Journal of Computer Vision 128, 2552-2569, 2020, 2020
LEGO: Language Enhanced Multi-modal Grounding Model
Z Li, Q Xu, D Zhang, H Song, Y Cai, Q Qi, R Zhou, J Pan, Z Li, VT Vu, ...
arXiv preprint arXiv:2401.06071, 2024
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs
Z Lu, A Zhou, H Ren, K Wang, W Shi, J Pan, M Zhan, H Li
arXiv preprint arXiv:2402.16352, 2024
Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset
K Wang, J Pan, W Shi, Z Lu, M Zhan, H Li
arXiv preprint arXiv:2402.14804, 2024
Sparsemae: Sparse training meets masked autoencoders
A Zhou, Y Li, Z Qin, J Liu, J Pan, R Zhang, R Zhao, P Gao, H Li
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
H Ren, M Zhan, Z Wu, A Zhou, J Pan, H Li
arXiv preprint arXiv:2405.17057, 2024
Online action detection
J Pan
Universitat Politčcnica de Catalunya, 2017
The system can't perform the operation now. Try again later.
Articles 1–20