I am a PhD student in DSA Thrust of the Information Hub at Hong Kong University of Science and Technology (Guangzhou) advised by Prof. Xinlei He.
My research interest includes AI Security and Privacy. ().
๐ฅ News
- 2025.5:ย ๐๐(AIGT detection) One paper got accepted in ACL 2025; Two paper got accepted in KDD D&B track 2025
- 2025.3:ย ๐๐ PEFTGuard got accepted in IEEE S$\&$P 2025
- 2024.11: ย ๐๐ Our paper won the Best Paper Award of SENSYS-SocialMetaโ24.
- 2024.06: ย ๐๐ I receive my firm PhD offer from HKUST(GZ).
๐ Publications
$^\star$: Equal contribution; $^\dagger$: Corresponding author
Conference
-
[ACL Mainโ25] Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media
Zhen Sun$^\star$, Zongmin Zhang$^\star$, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, Xinlei He
CCF-A
[arxiv] -
[IEEE S$\&$Pโ25] PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning
Zhen Sun, Tianshuo Cong, Yule Liu, Chenhao Lin, Xinlei He, Rongmao Chen, Xingshuo Han, and Xinyi Huang.
CCF-A
[arxiv] (AR: 257/1740=14.8%, Cycle 2 AR: 151/1001=15.1%) -
[KDD D&Bโ25]ย On the Generalization Ability of Machine-Generated Text Detectors
Yule Liu$^\star$, Zhiyuan Zhong$^\star$, Yifan Liao, Zhen Sun, Jingyi Zheng, Jiaheng Wei, Qingyuan Gong, Fenghua Tong, Yang Chen, Yang Zhang, Xinlei He
CCF-A
[arxiv] -
[KDD D&Bโ25] TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors
Jingyi Zheng$^\star$, Junfeng Wang$^\star$, Zhen Sun, Wenhan Dong, Yule Liu, Xinlei He
CCF-A
[arxiv] -
[SENSYS-SocialMetaโ24] AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social Media
Zongmin Zhang, Yujie Han, Zhou Zhang, Yule Liu, Jingyi Zheng, and Zhen Sun$^\dagger$.
In Proceedings of the Third International Workshop on Social and Metaverse Computing, Sensing and Networking, pp. 50-56. 2024.
CCF-B
[code] ๐ Best Paper Award
Under Review $\&$ Manuscript
-
Jailbreak Attacks and Defenses Against Large Language Models: A Survey [arxiv]
Sibo Yi$^\star$, Yule Liu$^\star$, Zhen Sun$^\star$, Tianshuo Cong, Xinlei He, Jiaxing Song, Ke Xu, and Qi Li.
-
Quantized Delta Weight Is Safety Keeper [arxiv]
Yule Liu, Zhen Sun, Xinlei He, and Xinyi Huang
-
FC-Attack: Jailbreaking Large Vision-Language Models via Auto-Generated Flowcharts[arxiv]
Ziyi Zhang$^\star$, Zhen Sun$^\star$, Zongmin Zhang, Jihui Guo, Xinlei He
-
The Rising Threat to Emerging AI-Powered Search Engines[arxiv]
Zeren Luo, Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Jingyi Zheng, Xinlei He
๐จโ๐Services
Reviewer of Conference
- ICML
- CVPR
- SaTML
- EuroS$\&$P
- AsiaCCS
- AAAI
- MM
Reviewer of Journals
- IEEE Transactions on Dependable and Secure Computing (TDSC)
- ACM Transactions on Privacy and Security (TOPS)
๐ฅ Honors and Awards
- ๐ฅKaggle Competitions Expert (Vincent Sirius)
- 2020.04, MCM/ICM Meritorious Winner
- 2019 / 2020 / 2021, Third-class Scholarship of BUPT
- 2019 / 2020 / 2021, Excellent Student Leader of BUPT
๐ Educations
- 2024.08-now, PhD in Data Science Analysis, Hong Kong University of Science and Technology (Guangzhou)
- 2022.08-2023.10, MSc in Computer Science, City University of Hong Kong
- 2018.09-2022.07, BSc in Computer Science and Technology, Beijing University of Posts and Telecommunications
๐ป Experiences
-
[Research Assistant] 2023.06 - 2024.05, Centre for Artificial Intelligence and Robotics (CAIR) Hong Kong Institute of Science $\&$ Innovation, Chinese Academy of Sciences (HKISI-CAS) - Surgical LLM and Image Segmentation, Supervisor: Dr. Jinlin Wu and Dr. Zhen Chen
-
[Project Participant] 2022.09-2023.08, City University of Hong Kong - Financial Machine Translation, Supervisor: Prof. Linqi Song