taghizadeh 's Collections llms
updated
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
• 2401.01055
• Published
• 55
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
• 2401.01335
• Published
• 68
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
• 2401.00908
• Published
• 189
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
• 2401.01854
• Published
• 11
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper
• 2401.02038
• Published
• 65
TinyLlama: An Open-Source Small Language Model
Paper
• 2401.02385
• Published
• 95
LLaMA Pro: Progressive LLaMA with Block Expansion
Paper
• 2401.02415
• Published
• 54
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence
Lengths in Large Language Models
Paper
• 2401.04658
• Published
• 27
The Impact of Reasoning Step Length on Large Language Models
Paper
• 2401.04925
• Published
• 18
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
• 2401.13303
• Published
• 12
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document
Understanding
Paper
• 2403.12895
• Published
• 32
Localizing Paragraph Memorization in Language Models
Paper
• 2403.19851
• Published
• 15
Transformer-Lite: High-efficiency Deployment of Large Language Models on
Mobile Phone GPUs
Paper
• 2403.20041
• Published
• 34
LLoCO: Learning Long Contexts Offline
Paper
• 2404.07979
• Published
• 22
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
Models
Paper
• 2404.12387
• Published
• 39
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
• 2404.14619
• Published
• 126