Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
haoheliu's picture
96 8 90

haoheliu

haoheliu
gkiwi's profile picture pbotsaris's profile picture ddbbaammoorriinn's profile picture
·
  • haoheliu

AI & ML interests

None yet

Organizations

Centre for Vision, Speech and Signal Processing - University of Surrey's profile picture Audio-AGI's profile picture USCD REACH's profile picture

upvoted a paper 6 months ago

MOSPA: Human Motion Generation Driven by Spatial Audio

Paper • 2507.11949 • Published Jul 16, 2025 • 24
upvoted a paper 11 months ago

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23, 2025 • 36
upvoted a paper about 1 year ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19, 2024 • 5
upvoted 2 papers over 1 year ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30, 2024 • 17

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 32
upvoted 3 papers over 2 years ago

AudioSR: Versatile Audio Super-resolution at Scale

Paper • 2309.07314 • Published Sep 13, 2023 • 28

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 44
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs