Efficient Vision Encoding for Vision Language Models
AI & ML interests
None defined yet.
Recent Activity
Papers
Sharp Monocular View Synthesis in Less Than a Second
Learning Unmasking Policies for Diffusion Language Models
Team members
973
private
Organization Card
Welcome to the official Hugging Face organization for Apple!
Apple Core ML – Build intelligence into your apps
Core ML is optimized for on-device performance of a broad variety of model types by leveraging Apple Silicon and minimizing memory footprint and power consumption.
- Models
- FastVLM Core ML: On-device Vision-Language Model.
- Depth Anything V2 Core ML: State-of-the-art depth estimation
- DETR Resnet50 Core ML: Semantic Segmentation
- FastViT Core ML: Image Classification
- Stable Diffusion Core ML
- Additional Core ML Model Gallery Models
Apple Machine Learning Research
Open research to enable the community to deliver amazing experiences that improve the lives of millions of people every day.
Models
- MobileCLIP 2: Mobile-friendly SOTA image-text models.
- FastVLM: Efficient Vision Language Models.
- DepthPro: State-of-the-art monocular depth estimation.
- Sharp: 3D gaussians estimation from single images
- OpenELM Base | Instruct: open, Transformer-based language model.
- MobileCLIP: Mobile-friendly image-text models.
- DCLM: State-of-the-art open data language models via dataset curation.
- DFN: State-of-the-art open data CLIP models via dataset curation.
Datasets
- FLAIR: A large image dataset for federated learning.
- DataCompDR: Improved datasets for training image-text models.
Benchmarks
- TiC-CLIP: Benchmark for the design of efficient continual learning of image-text models over years
Select Highlights and Other Resources
- Hugging Face CoreML Examples – Run Core ML models with two lines of code!
- Apple Model Gallery
- New features in Core ML Tools
- Apple Core ML Stable Diffusion – Library to run Stable Diffusion on Apple Silicon with Core ML.
- Hugging Face Blog Posts
models
138
apple/Sharp
Image-to-3D
•
Updated
•
104
apple/CLaRa-7B-Instruct
Updated
•
161
apple/DiffuCoder-7B-Base
8B
•
Updated
•
310
•
26
apple/DiffuCoder-7B-Instruct
8B
•
Updated
•
1.16k
•
57
apple/DiffuCoder-7B-cpGRPO
8B
•
Updated
•
757
•
316
apple/CLaRa-7B-Base
Updated
•
13
apple/CLaRa-7B-E2E
Updated
•
17
apple/starflow
Updated
•
265
apple/CLaRa-7B-Base-16
Updated
•
2
apple/mobileclip2_coca_dfn2b_s13b_context77
Updated
•
10
datasets
10
apple/CLaRa_multi_stage
Viewer
•
Updated
•
1.03M
•
398
•
5
apple/DataCompDR-12M-bf16
Updated
•
5.47k
•
4
apple/DataCompDR-12M
Viewer
•
Updated
•
12.8M
•
4.33k
•
31
apple/DataCompDR-1B
Viewer
•
Updated
•
1.28B
•
16.5k
•
27
apple/DataComp-12M
Viewer
•
Updated
•
12.8M
•
146
•
3
apple/GSM-Symbolic
Viewer
•
Updated
•
12.5k
•
1.12k
•
20
apple/mmau
Preview
•
Updated
•
284
•
4
apple/TiC-DataComp
Preview
•
Updated
•
2.15k
•
3
apple/flair
Viewer
•
Updated
•
429k
•
260
•
16
apple/mkqa
Updated
•
577
•
39