Follow
Yangsibo Huang
Yangsibo Huang
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Evaluating Gradient Inversion Attacks and Defenses in Federated Learning
Y Huang, S Gupta, Z Song, K Li, S Arora
NeurIPS 2021, 2021
2492021
Deep Q learning driven CT pancreas segmentation with geometry-aware U-Net
Y Man*, Y Huang*, J Feng, X Li, F Wu
IEEE Transactions on Medical Imaging, 2019
1632019
Catastrophic Jailbreak of Open-Source LLMs via Exploiting Generation
Y Huang, S Gupta, M Xia, K Li, D Chen
ICLR 2024, 2024
1562024
Instahide: Instance-hiding schemes for private distributed learning
Y Huang, Z Song, K Li, S Arora
ICML 2020, 2020
1542020
Detecting pretraining data from large language models
W Shi, A Ajith, M Xia, Y Huang, D Liu, T Blevins, D Chen, L Zettlemoyer
ICLR 2024, 2024
1402024
Recovering Private Text in Federated Learning of Language Models
S Gupta*, Y Huang*, Z Zhong, T Gao, K Li, D Chen
NeurIPS 2022, 2022
682022
TextHide: Tackling Data Privacy in Language Understanding Tasks
Y Huang, Z Song, D Chen, K Li, S Arora
EMNLP 2020, 2020
572020
Advancing differential privacy: Where we are now and future directions for real-world deployment
R Cummings, D Desfontaines, D Evans, R Geambasu, Y Huang, ...
Harvard Data Science Review, 2024
43*2024
DeepMC: a deep learning method for efficient Monte Carlo beamlet dose calculation by predictive denoising in magnetic resonance-guided radiotherapy
R Neph, Q Lyu, Y Huang, YM Yang, K Sheng
Physics in Medicine & Biology 66 (3), 035022, 2021
40*2021
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
B Wei*, K Huang*, Y Huang*, T Xie, X Qi, M Xia, P Mittal, M Wang, ...
ICML 2024, 2024
342024
A Dataset Auditing Method for Collaboratively Trained Machine Learning Models
Y Huang, CY Huang, X Li, K Li
IEEE Transactions on Medical Imaging, 2022
25*2022
Privacy Implications of Retrieval-Based Language Models
Y Huang, S Gupta, Z Zhong, K Li, D Chen
EMNLP 2023, 2023
242023
Privacy-Preserving Learning via Deep Net Pruning
Y Huang, Y Su, S Ravi, Z Song, S Arora, K Li
arXiv preprint arXiv:2003.01876, 2020
21*2020
NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Y Huang, D Liu, Z Zhong, W Shi, YT Lee
arXiv preprint arXiv:2302.10879, 2023
132023
A Safe Harbor for AI Evaluation and Red Teaming
S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ...
ICML 2024, 2024
122024
MUSE: Machine Unlearning Six-way Evaluation for Language Models
W Shi, J Lee, Y Huang, S Malladi, J Zhao, A Holtzman, D Liu, ...
arXiv preprint arXiv:2407.06460, 2024
102024
SORRY-bench: Systematically evaluating large language model safety refusal behaviors
T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ...
arXiv preprint arXiv:2406.14598, 2024
82024
IFGAN: Missing Value Imputation using Feature-specific Generative Adversarial Networks
W Qiu, Y Huang, Q Li
2020 IEEE International Conference on Big Data (Big Data), 2020
62020
Evaluating Copyright Takedown Methods for Language Models
B Wei, W Shi, Y Huang, NA Smith, C Zhang, L Zettlemoyer, K Li, ...
arXiv preprint arXiv:2406.18664, 2024
52024
AI Risk Management Should Incorporate Both Safety and Security
X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ...
arXiv preprint arXiv:2405.19524, 2024
42024
The system can't perform the operation now. Try again later.
Articles 1–20