GUARD-SLM: Token Activation-Based Defense Against Jailbreak Attacks for Small Language Models
Preprint
Mia, Md Jueal, Molto, Joaquin, Wu, Yanzhao et al. (2026). GUARD-SLM: Token Activation-Based Defense Against Jailbreak Attacks for Small Language Models
. 10.48550/arxiv.2603.28817
Mia, Md Jueal, Molto, Joaquin, Wu, Yanzhao et al. (2026). GUARD-SLM: Token Activation-Based Defense Against Jailbreak Attacks for Small Language Models
. 10.48550/arxiv.2603.28817