Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling Paper • 2509.08753 • Published Sep 10 • 2
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 3 days ago • 36
Uni-MoE 2.0 Collection The second version of omimodal large model Uni-MoE • 5 items • Updated Nov 20 • 1
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published Nov 16 • 103
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching Paper • 2507.09318 • Published Jul 12 • 2
📙 LLM Engineer's Handbook Collection Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated Apr 7 • 14