Prototypical pseudo label denoising and target structure learning for domain adaptive semantic segmentation P Zhang, B Zhang, T Zhang, D Chen, Y Wang, F Wen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 465 | 2021 |
Cross-domain correspondence learning for exemplar-based image translation P Zhang, B Zhang, D Chen, L Yuan, F Wen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 397 | 2020 |
Cocosnet v2: Full-resolution correspondence learning for image translation X Zhou, B Zhang, T Zhang, P Zhang, J Bao, D Chen, Z Zhang, F Wen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 265 | 2021 |
Bringing old photos back to life Z Wan, B Zhang, D Chen, P Zhang, D Chen, J Liao, F Wen proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 209 | 2020 |
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023 | 116 | 2023 |
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin arXiv preprint arXiv:2311.12793, 2023 | 74 | 2023 |
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, XDB Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, S Zhang, ... arXiv preprint arXiv:2309.15112, 2023 | 58 | 2023 |
Old photo restoration via deep latent space translation Z Wan, B Zhang, D Chen, P Zhang, F Wen, J Liao IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 2071-2087, 2022 | 56 | 2022 |
V3det: Vast vocabulary visual detection dataset J Wang, P Zhang, T Chu, Y Cao, Y Zhou, T Wu, B Wang, C He, D Lin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 26 | 2023 |
Vigc: Visual instruction generation and correction B Wang, F Wu, X Han, J Peng, H Zhong, P Zhang, X Dong, W Li, W Li, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5309-5317, 2024 | 24 | 2024 |
InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024 | 24 | 2024 |
Metaportrait: Identity-preserving talking head generation with fast personalized adaptation B Zhang, C Qi, P Zhang, B Zhang, HT Wu, D Chen, Q Chen, Y Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 22 | 2023 |
Robust mutual learning for semi-supervised semantic segmentation P Zhang, B Zhang, T Zhang, D Chen, F Wen arXiv preprint arXiv:2106.00609, 2021 | 17 | 2021 |
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ... arXiv preprint arXiv:2311.17911, 2023 | 15 | 2023 |
Freedrag: Point tracking is not you need for interactive point-based image editing P Ling, L Chen, P Zhang, H Chen, Y Jin arXiv preprint arXiv:2307.04684, 2023 | 12 | 2023 |
Alpha-CLIP: A clip model focusing on wherever you want Z Sun, Y Fang, T Wu, P Zhang, Y Zang, S Kong, Y Xiong, D Lin, J Wang arXiv preprint arXiv:2312.03818, 2023 | 9 | 2023 |
Real-time neural character rendering with pose-guided multiplane images H Ouyang, B Zhang, P Zhang, H Yang, J Yang, D Chen, Q Chen, F Wen European Conference on Computer Vision, 192-209, 2022 | 9 | 2022 |
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 7 | 2024 |
Hyperdreamer: Hyper-realistic 3d content generation and editing from a single image T Wu, Z Li, S Yang, P Zhang, X Pan, J Wang, D Lin, Z Liu SIGGRAPH Asia 2023 Conference Papers, 1-10, 2023 | 5 | 2023 |
Long-clip: Unlocking the long-text capability of clip B Zhang, P Zhang, X Dong, Y Zang, J Wang arXiv preprint arXiv:2403.15378, 2024 | 3 | 2024 |