GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Y Yuan, W Jiao, W Wang, J Huang, P He, S Shi, Z Tu The Twelfth International Conference on Learning Representations, 2023 | 40 | 2023 |
All languages matter: On the multilingual safety of large language models W Wang, Z Tu, C Chen, Y Yuan, J Huang, W Jiao, MR Lyu arXiv preprint arXiv:2310.00905, 2023 | 13 | 2023 |
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs J Huang, W Wang, EJ Li, MH Lam, S Ren, Y Yuan, W Jiao, Z Tu, MR Lyu The Twelfth International Conference on Learning Representations, 2023 | 12* | 2023 |
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments J Huang, EJ Li, MH Lam, T Liang, W Wang, Y Yuan, W Jiao, X Wang, Z Tu, ... arXiv preprint arXiv:2403.11807, 2024 | 2 | 2024 |
A & B== B & A: Triggering logical reasoning failures in large language models Y Wan, W Wang, Y Yang, Y Yuan, J Huang, P He, W Jiao, MR Lyu arXiv preprint arXiv:2401.00757, 2024 | 2 | 2024 |
The Earth is Flat? Unveiling Factual Errors in Large Language Models W Wang, J Shi, Z Tu, Y Yuan, J Huang, W Jiao, MR Lyu arXiv preprint arXiv:2401.00761, 2024 | 1 | 2024 |