Follow
Xinyue Shen
Xinyue Shen
CISPA Helmholtz Center for Information Security
Verified email at cispa.de - Homepage
Title
Cited by
Cited by
Year
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
X Shen, Z Chen, M Backes, Y Shen, Y Zhang
arXiv preprint arXiv:2308.03825, 2023
1212023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
X Shen, Z Chen, M Backes, Y Zhang
arXiv preprint arXiv:2304.08979, 2023
60*2023
Evil Under the Sun: Understanding and Discovering Attacks on Ethereum Decentralized Applications
L Su, X Shen, X Du, X Liao, XF Wang, L Xing, B Liu
502021
MGTBench: Benchmarking Machine-Generated Text Detection
X He, X Shen, Z Chen, M Backes, Y Zhang
arXiv preprint arXiv:2303.14822, 2023
492023
Unsafe diffusion: On the generation of unsafe images and hateful memes from text-to-image models
Y Qu, X Shen, X He, M Backes, S Zannettou, Y Zhang
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications …, 2023
332023
On Xing Tian and the Perseverance of Anti-China Sentiment Online
X Shen, X He, M Backes, J Blackburn, S Zannettou, Y Zhang
Proceedings of the International AAAI Conference on Web and Social Media 16 …, 2022
142022
Prompt Stealing Attacks Against Text-to-Image Generation Models
X Shen, Y Qu, M Backes, Y Zhang
arXiv preprint arXiv:2302.09923, 2023
112023
Backdoor Attacks in the Supply Chain of Masked Image Modeling
X Shen, X He, Z Li, Y Shen, M Backes, Y Zhang
arXiv preprint arXiv:2210.01632, 2022
72022
Comprehensive Assessment of Jailbreak Attacks Against LLMs
J Chu, Y Liu, Z Yang, X Shen, M Backes, Y Zhang
arXiv preprint arXiv:2402.05668, 2024
12024
Comprehensive Assessment of Toxicity in ChatGPT
B Zhang, X Shen, WM Si, Z Sha, Z Chen, A Salem, Y Shen, M Backes, ...
arXiv preprint arXiv:2311.14685, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–10