Seguir
Zidi Xiong
Zidi Xiong
Dirección de correo verificada de illinois.edu - Página principal
Título
Citado por
Citado por
Año
Decodingtrust: A comprehensive assessment of trustworthiness in gpt models
B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ...
arXiv preprint arXiv:2306.11698, 2023
1222023
Umd: Unsupervised model detection for x2x backdoor attacks
Z Xiang, Z Xiong, B Li
International Conference on Machine Learning, 38013-38038, 2023
102023
Badchain: Backdoor chain-of-thought prompting for large language models
Z Xiang, F Jiang, Z Xiong, B Ramasubramanian, R Poovendran, B Li
arXiv preprint arXiv:2401.12242, 2024
92024
Label-smoothed backdoor attack
M Peng, Z Xiong, M Sun, P Li
arXiv preprint arXiv:2202.11203, 2022
82022
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Z Yuan, Z Xiong, Y Zeng, N Yu, R Jia, D Song, B Li
arXiv preprint arXiv:2403.13031, 2024
12024
CBD: A certified backdoor detector based on local dominant probability
Z Xiang, Z Xiong, B Li
Advances in Neural Information Processing Systems 36, 2024
12024
Rethinking the Necessity of Labels in Backdoor Removal
Z Xiong, D Wu, Y Wang, Y Wang
ICLR 2023 Workshop on Backdoor Attacks and Defenses in Machine Learning, 2023
2023
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–7