Zihao Zhu's picture

1 4 14

Zihao Zhu

ZihaoZhu

·

https://zihao-ai.github.io/

zihao-ai

AI & ML interests

LLM safety

Organizations

None yet

authored a paper 3 months ago

AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models

Paper • 2509.24269 • Published Sep 29, 2025 • 3

authored a paper 4 months ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3, 2025 • 24

authored 3 papers 10 months ago

VDC: Versatile Data Cleanser for Detecting Dirty Samples via Visual-Linguistic Inconsistency

Paper • 2309.16211 • Published Sep 28, 2023

BoT: Breaking Long Thought Processes of o1-like Large Language Models through Backdoor Attack

Paper • 2502.12202 • Published Feb 16, 2025

BackdoorBench: A Comprehensive Benchmark of Backdoor Learning

Paper • 2206.12654 • Published Jun 25, 2022 • 1