Jailbreak attacks and defenses against large language models: A survey S Yi, Y Liu, Z Sun, T Cong, X He, J Song, K Xu, Q Li arXiv preprint arXiv:2407.04295, 2024 | 20 | 2024 |
Quantized Delta Weight Is Safety Keeper Y Liu, Z Sun, X He, X Huang arXiv preprint arXiv:2411.19530, 2024 | | 2024 |
PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning Z Sun, T Cong, Y Liu, C Lin, X He, R Chen, X Han, X Huang arXiv preprint arXiv:2411.17453, 2024 | | 2024 |
Revealing the Difficulty in Jailbreak Defense on Language Models for Metaverse Z Kang, Y Liu, J Zheng, Z Sun Proceedings of the Third International Workshop on Social and Metaverse …, 2024 | | 2024 |
AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social Media Z Zhang, Y Han, Z Zhang, Y Liu, J Zheng, Z Sun Proceedings of the Third International Workshop on Social and Metaverse …, 2024 | | 2024 |
GENNDTI: Drug-target interaction prediction using graph neural network enhanced by router nodes B Yang, Y Liu, J Wu, F Bai, M Zheng, J Zheng IEEE Journal of Biomedical and Health Informatics, 2024 | | 2024 |