Joon Kim's picture

18 2

Joon Kim

Joonkkk

·

AI & ML interests

None yet

Organizations

None yet

upvoted 4 papers 3 months ago

Fine-tuning Done Right in Model Editing

Paper • 2509.22072 • Published Sep 26, 2025 • 28

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 65

Advancing Speech Understanding in Speech-Aware Language Models with GRPO

Paper • 2509.16990 • Published Sep 21, 2025 • 18

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80

upvoted 6 papers 4 months ago

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research

Paper • 2507.13300 • Published Jul 17, 2025 • 19

NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining

Paper • 2507.14119 • Published Jul 18, 2025 • 58

ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling

Paper • 2508.15767 • Published Aug 21, 2025 • 17

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

Paper • 2508.17198 • Published Aug 24, 2025 • 9

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Paper • 2509.09680 • Published Sep 11, 2025 • 43

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53

upvoted 8 papers 8 months ago

The Aloe Family Recipe for Open and Specialized Healthcare LLMs

Paper • 2505.04388 • Published May 7, 2025 • 26

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 28

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published May 20, 2025 • 32

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published Apr 29, 2025 • 31

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20, 2025 • 52

Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

Paper • 2505.11254 • Published May 16, 2025 • 48

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62