Publications
2025
-
The Rising Threat to Emerging AI-Powered Search Engines2025 -
Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social MediaarXiv preprint arXiv:2412.18148, 2025 -
SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning2025
2024
-
Quantized Delta Weight Is Safety KeeperarXiv preprint arXiv:2411.19530, 2024 -
On the Generalization Ability of Machine-Generated Text DetectorsarXiv preprint arXiv:2412.17242, 2024 -
Jailbreak attacks and defenses against large language models: A surveyarXiv preprint arXiv:2407.04295, 2024 -
PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-TuningIEEE Symposium on Security and Privacy (On coming), 2024 -
GENNDTI: Drug-target interaction prediction using graph neural network enhanced by router nodesIEEE Journal of Biomedical and Health Informatics (Highlights), 2024 - AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social MediaIn Proceedings of the Third International Workshop on Social and Metaverse Computing, Sensing and Networking , 2024
- Revealing the Difficulty in Jailbreak Defense on Language Models for MetaverseIn Proceedings of the Third International Workshop on Social and Metaverse Computing, Sensing and Networking , 2024