Zhen Sun

Zhen Sun

Trustworthy AI | AI Safety, Security, and Privacy

Research Theme

Trustworthy AI

PhD Student in Data Science and Analytics at The Hong Kong University of Science and Technology (Guangzhou).
Advised by Prof. Xinlei He (Wuhan University, with primary supervision from Prof. jiaheng Wei and co-supervision from Prof. Yutao Yue in HKUST(GZ).

Google Scholar Email GitHub

Research Interests

Trustworthy AI AI Security & Privacy AI-Generated Content Detection AI For Safety

News

Paper Updates

🎉2026.01JALMBench was accepted at ICLR 2026.
🎉2025.126DAttack was accepted at AAAI 2026 as an Oral presentation.
🎉2025.09FC-Attack was accepted at EMNLP 2025 Findings.
🎉2025.09CHASM was accepted at NeurIPS 2025.
🎉2025.06Unsafe LLM-Based Search was accepted at USENIX Security 2025.
🎉2025.05AIGT on Social Media was accepted at ACL 2025, and MGT Generalization plus TH-Bench were accepted at KDD 2025.
🎉2025.03PEFTGuard was accepted at IEEE S&P 2025.

Awards and Milestones

🏆2025.12I received the 2025 DSA Excellent Research Award.
🏆2024.11AdSpectorX received the Best Paper Award at SENSYS-SocialMeta 2024.
🏆2024.06I received my firm PhD offer from HKUST(GZ).

Selected Papers

^* denotes equal contribution. ^† denotes corresponding author.

IEEE S&P 2025 CCF-A PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning Zhen Sun, Tianshuo Cong, Yule Liu, Chenhao Lin, Xinlei He, Rongmao Chen, Xingshuo Han, Xinyi Huang. ACL 2025 CCF-A Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media Zhen Sun^*, Zongmin Zhang^*, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, Xinlei He. EMNLP 2025 Findings CCF-B FC-Attack: Jailbreaking Large Vision-Language Models via Auto-Generated Flowcharts Ziyi Zhang^*, Zhen Sun^*, Zongmin Zhang, Jihui Guo, Xinlei He. KDD 2025 CCF-A TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors Jingyi Zheng, Junfeng Wang, Zhen Sun, Wenhan Dong, Yule Liu, Xinlei He. AAAI 2026 CCF-A · Oral 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation Jihui Guo, Zongmin Zhang, Zhen Sun, Yuhao Yang, Jinlin Wu, Fu Zhang, Xinlei He. ICLR 2026CCF-A JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Zeren Luo, Jingyi Zheng, Wenhan Dong, Xinlei He, Xuechao Wang, Yingjie Xue, Shengmin Xu, Xinyi Huang. KDD 2025 CCF-A On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing Yule Liu, Zhiyuan Zhong, Yifan Liao, Zhen Sun, Jingyi Zheng, Jiaheng Wei, Qingyuan Gong, Fenghua Tong, Yang Chen, Yang Zhang, Xinlei He. USENIX Security 2025 CCF-A Unsafe LLM-Based Search: Quantitative Analysis and Mitigation of Safety Risks in AI Web Search Zeren Luo, Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Jingyi Zheng, Xinlei He. NeurIPS 2025 CCF-A CHASM: Unveiling Covert Advertisements on Chinese Social Media Jingyi Zheng, Tianyi Hu, Yule Liu, Zhen Sun, Zongmin Zhang, Zifan Peng, Wenhan Dong, Xinlei He. SENSYS-SocialMeta 2024 Best Paper AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social Media Zongmin Zhang, Yujie Han, Zhou Zhang, Yule Liu, Jingyi Zheng, Zhen Sun^†.

Services

Conference PC / Reviewer

The Web Conference 2025 Web4Good Track
AAAI
ACM MM
ICML
CVPR
ACL
EMNLP
SaTML
EuroS&P
AsiaCCS

Journal Reviewer

IEEE Transactions on Dependable and Secure Computing (TDSC)
IEEE Transactions on Information Forensics and Security (TIFS)
ACM Transactions on Privacy and Security (TOPS)
International Journal of Human-Computer Interaction (IJHCI)

Honors and Awards

2025, DSA Excellent Research Award
Kaggle Competitions Expert (Vincent Sirius)
2020.04, MCM/ICM Meritorious Winner
2019 / 2020 / 2021, Third-class Scholarship of BUPT
2019 / 2020 / 2021, Excellent Student Leader of BUPT

Education

2024.08-present, PhD in Data Science and Analytics, The Hong Kong University of Science and Technology (Guangzhou)
2022.08-2023.10, MSc in Computer Science, City University of Hong Kong
2018.09-2022.07, BSc in Computer Science and Technology, Beijing University of Posts and Telecommunications

Experience

Research Assistant, 2023.06-2024.05, Centre for Artificial Intelligence and Robotics (CAIR), Hong Kong Institute of Science & Innovation, Chinese Academy of Sciences (HKISI-CAS). Worked on surgical LLMs and image segmentation. Supervisor: Dr. Jinlin Wu.
Project Participant, 2022.09-2023.08, City University of Hong Kong. Worked on financial machine translation. Supervisor: Prof. Linqi Song.