MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Paper • 2508.17467 • Published Aug 24, 2025
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Paper • 2509.04377 • Published Sep 4, 2025
LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference Paper • 2509.02753 • Published Sep 2, 2025
ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models Paper • 2510.01582 • Published Oct 2, 2025
Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting Paper • 2509.25631 • Published Sep 30, 2025 • 1
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Paper • 2509.13523 • Published Sep 16, 2025 • 7
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Paper • 2509.13523 • Published Sep 16, 2025 • 7
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 30
A Survey of Techniques for Optimizing Transformer Inference Paper • 2307.07982 • Published Jul 16, 2023
Scalable Reinforcement-Learning-Based Neural Architecture Search for Cancer Deep Learning Research Paper • 1909.00311 • Published Sep 1, 2019
Applications of Machine Learning to Lattice Quantum Field Theory Paper • 2202.05838 • Published Feb 10, 2022 • 1
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies Paper • 2310.04610 • Published Oct 6, 2023 • 1
view post Post 👋 I'm SamI use ML and HPC to accelerate scientific discovery @ Argonne National Laboratory*https://samforeman.mehttps://twitter.com/saforem2https://github.com/saforem2* https://alcf.anl.gov/about/people/sam-foreman 🤗 5 5 + Reply
LeapfrogLayers: A Trainable Framework for Effective Topological Sampling Paper • 2112.01582 • Published Dec 2, 2021
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators Paper • 2310.04607 • Published Oct 6, 2023
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators Paper • 2310.04607 • Published Oct 6, 2023
MLMC: Machine Learning Monte Carlo for Lattice Gauge Theory Paper • 2312.08936 • Published Dec 14, 2023