Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yi Zeng's picture
2 12

Yi Zeng

yizeng
ksiabani's profile picture 21world's profile picture
·

AI & ML interests

None yet

Organizations

LLM-Tuning-Safety's profile picture Responsible Data Science Lab's profile picture SORRY-Bench's profile picture

upvoted a paper 11 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7, 2025 • 22
upvoted a collection over 1 year ago

BEEAR

Collection
These models are used for re-implementation of our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction" • 8 items • Updated Jun 28, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs