Flax Community

non-profit

https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects

AI & ML interests

JAX, Flax, TPU, 🤗

Recent Activity

vinid submitted a paper 23 days ago

Learning to Discover at Test Time

christopher authored a paper 2 months ago

Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem

christopher authored a paper 4 months ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

View all activity

amphora

submitted a paper to Daily Papers 6 days ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published 9 days ago • 22

stefan-it

submitted a paper to Daily Papers 16 days ago

FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

Paper • 2601.22146 • Published 16 days ago • 9

vinid

submitted a paper to Daily Papers 23 days ago

Learning to Discover at Test Time

Paper • 2601.16175 • Published 23 days ago • 41

christopher

authored a paper 2 months ago

Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem

Paper • 2512.03073 • Published Nov 27, 2025 • 6

4rtemi5

authored 2 papers 3 months ago

On Space Folds of ReLU Neural Networks

Paper • 2502.09954 • Published Feb 14, 2025

The Space Between: On Folding, Symmetries and Sampling

Paper • 2503.08502 • Published Mar 11, 2025

stefan-it

authored a paper 4 months ago

SindBERT, the Sailor: Charting the Seas of Turkish NLP

Paper • 2510.21364 • Published Oct 24, 2025 • 1

christopher

authored a paper 4 months ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Paper • 2510.13996 • Published Oct 15, 2025 • 9

stefan-it

authored a paper 4 months ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Paper • 2510.13996 • Published Oct 15, 2025 • 9

vumichien

authored 2 papers 4 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 9

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

christopher

posted an update 4 months ago

Post

630

Something very cool is cooking at

1 reply

·

stefan-it

authored a paper 5 months ago

Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

Paper • 2509.05668 • Published Sep 6, 2025 • 6

nipunsadvilkar

in flax-community/roberta-base-mr 7 months ago

Adding `safetensors` variant of this model

#1 opened 12 months ago by

Mrinal

authored a paper 7 months ago

Multilingual State Space Models for Structured Question Answering in Indic Languages

Paper • 2502.01673 • Published Feb 1, 2025 • 2

fgaim

authored a paper 8 months ago

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Paper • 2505.12116 • Published May 17, 2025

amphora

authored a paper 9 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17, 2025 • 10

Muennighoff

authored a paper 9 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8, 2025 • 8

andreidima

authored a paper 10 months ago

RoQLlama: A Lightweight Romanian Adapted Language Model

Paper • 2410.04269 • Published Oct 5, 2024

Muennighoff

authored a paper 10 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29, 2025 • 54