Arthur LAGACHERIE's picture

Arthur LAGACHERIE

styal

·

styalai

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

mixedbread-ai/mxbai-edge-colbert-v0-32m

liked a model 3 days ago

google/translategemma-4b-it

liked a dataset 14 days ago

OpenDataArena/ODA-Mixture-500k

View all activity

Organizations

None yet

upvoted 3 papers 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 106

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 37

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted a collection 2 months ago

Pre-training Dataset Samples

A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated 25 days ago • 18

upvoted 2 papers 2 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 122

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

upvoted 5 papers 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 178

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 120

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 116

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 57

upvoted 2 papers 4 months ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17, 2025 • 40

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2, 2025 • 69

upvoted 4 papers 5 months ago

Transition Models: Rethinking the Generative Learning Objective

Paper • 2509.04394 • Published Sep 4, 2025 • 28

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

CRISP: Persistent Concept Unlearning via Sparse Autoencoders

Paper • 2508.13650 • Published Aug 19, 2025 • 16

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 181

upvoted a paper 7 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

upvoted 2 articles 7 months ago

Article

🐯 Liger GRPO meets TRL

+4

May 25, 2025

•

52

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

+2

May 23, 2025

•

170