Rui Zhang
Hi, I am Rui Zhang. I am a Ph.D. student at University of Electronic Science and Technology of China (UESTC), supervised by Prof. Hongwei Li.
I was a visiting Ph.D. student at Nanyang Technological University (NTU, 2025-2026), working with Prof. Yang Liu. I feel fortunate to be mentored by Yang Zhang.
My research interests lie in Trustworthy Machine Learning, including the Security and Safety of Large Language Models (LLMs).
Recent Updates
(2025.04) Our paper titled “Hidden Tail: Adversarial Attack for Stealthy Resource Consumption against Vision-Language Models” got accepted by IEEE TDSC.
(2026.04) Our paper titled “The Art of (Mis)alignment: How Fine-Tuning Methods Effectively Misalign and Realign LLMs in Post-Training” got accepted by ACL Findings 2026.
(2026.01) Our paper titled “Backdoor Complications: A Comprehensive Analysis and Mitigation of the Unforeseen Consequences of Backdoor Attacks” got accepted by IEEE TDSC.
(2025.11) Our paper titled “MPMA: Preference Manipulation Attack Against Model Context Protocol” got accepted by AAAI 2026 (Oral).
(2025.11) Our paper titled “Confguard: A Simple and Effective Backdoor Detection for Large Language Models” got accepted by AAAI 2026.
