Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zihao Zhu's picture
1 4 14

Zihao Zhu

ZihaoZhu
wrt's profile picture yangjunxiao2021's profile picture
·
https://zihao-ai.github.io/
  • zihao-ai

AI & ML interests

LLM safety

Organizations

None yet

authored a paper 3 months ago

AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models

Paper • 2509.24269 • Published Sep 29, 2025 • 3
authored a paper 4 months ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3, 2025 • 24
authored 3 papers 10 months ago

VDC: Versatile Data Cleanser for Detecting Dirty Samples via Visual-Linguistic Inconsistency

Paper • 2309.16211 • Published Sep 28, 2023

BoT: Breaking Long Thought Processes of o1-like Large Language Models through Backdoor Attack

Paper • 2502.12202 • Published Feb 16, 2025

BackdoorBench: A Comprehensive Benchmark of Backdoor Learning

Paper • 2206.12654 • Published Jun 25, 2022 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs