Weilin Zhao's picture

3 6 14

Weilin Zhao

Achazwl

·

https://weilin-zhao.com

AI & ML interests

Efficient LLM

Recent Activity

liked a model 23 days ago

openbmb/VoxCPM1.5

liked a dataset about 1 month ago

openbmb/InfLLM-V2-data-5B

authored a paper 3 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

View all activity

Organizations

upvoted a paper 3 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 14

upvoted a collection 6 months ago

FR-Spec

Released ckpt for arxiv.org/abs/2502.14856 • 6 items • Updated Jul 2, 2025 • 1

upvoted a collection 7 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8, 2025 • 82

upvoted a paper 7 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 93

upvoted 2 papers 10 months ago

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published Feb 17, 2025 • 4

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20, 2025 • 8