Principle-driven self-alignment of language models from scratch with minimal human supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan
Thirty-seventh Conference on Neural Information Processing Systems, 2023
134 2023 Grounding physical object and event concepts through dynamic visual reasoning Z Chen, J Mao, J Wu, KYK Wong, JB Tenenbaum, C Gan
International Conference on Learning Representations, 2021
89 * 2021 Weakly-supervised spatio-temporally grounding natural sentence in video Z Chen, L Ma, W Luo, KYK Wong
Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019
89 2019 Star: A benchmark for situated reasoning in real-world videos B Wu, S Yu, Z Chen, JB Tenenbaum, C Gan
Thirty-fifth conference on neural information processing systems datasets …, 2021
84 2021 3d-llm: Injecting the 3d world into large language models Y Hong, H Zhen, P Chen, S Zheng, Y Du, Z Chen, C Gan
Thirty-seventh Conference on Neural Information Processing Systems, 2023
80 * 2023 Look closer to ground better: Weakly-supervised temporal grounding of sentence in video Z Chen, L Ma, W Luo, P Tang, KYK Wong
arXiv preprint arXiv:2001.09308, 2020
66 2020 Dynamic visual reasoning by learning differentiable physics models from video and language M Ding, Z Chen, T Du, P Luo, J Tenenbaum, C Gan
Advances In Neural Information Processing Systems 34, 887-899, 2021
56 2021 Cops-ref: A new dataset and task on compositional referring expression comprehension Z Chen, P Wang, L Ma, KYK Wong, Q Wu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
54 2020 Ps-nerf: Neural inverse rendering for multi-view photometric stereo W Yang, G Chen, C Chen, Z Chen, KYK Wong
European Conference on Computer Vision, 266-284, 2022
53 2022 Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan
International Conference on Learning Representations, 2023
52 2023 The blessings of unlabeled background in untrimmed videos Y Liu, J Chen, Z Chen, B Deng, J Huang, H Zhang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
40 2021 Comphy: Compositional physical reasoning of objects and events from videos Z Chen, K Yi, Y Li, M Ding, A Torralba, JB Tenenbaum, C Gan
International Conference on Learning Representations, 2022
37 2022 Mod-squad: Designing mixtures of experts as modular multi-task learners Z Chen, Y Shen, M Ding, Z Chen, H Zhao, EG Learned-Miller, C Gan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
32 2023 S -NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint W Yang, G Chen, C Chen, Z Chen, KYK Wong
Advances in Neural Information Processing Systems 35, 1568-1582, 2022
27 2022 3d concept learning and reasoning from multi-view images Y Hong, C Lin, Y Du, Z Chen, JB Tenenbaum, C Gan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
26 2023 Salmon: Self-alignment with principle-following reward models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, D Cox, Y Yang, C Gan
arXiv preprint arXiv:2310.05910, 2023
23 2023 Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning Z Chen, Q Zhou, Y Shen, Y Hong, Z Sun, D Gutfreund, C Gan
AAAI Conference on Artificial Intelligence, 2024
21 * 2024 Embodied concept learner: Self-supervised learning of concepts and mapping through instruction following M Ding, Y Xu, Z Chen, DD Cox, P Luo, JB Tenenbaum, C Gan
Conference on Robot Learning, 1743-1754, 2023
15 2023 Deep face video inpainting via UV mapping W Yang, Z Chen, C Chen, G Chen, KYK Wong
IEEE Transactions on Image Processing 32, 1145-1157, 2023
10 2023 Moduleformer: Learning modular large language models from uncurated data Y Shen, Z Zhang, T Cao, S Tan, Z Chen, C Gan
arXiv preprint arXiv:2306.04640, 2023
8 2023