Kevin King PRO
NeoCodes-dev
·
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
updated
a collection
1 day ago
ActionLanguageModels
updated
a collection
1 day ago
VLMs - Robotics
updated
a collection
1 day ago
VLMs - Robotics
Organizations
Benchmarks
Datasets - Agents
Datasets - Coding
ARC-AGI2
VLMs - Robotics
Embedding Models
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 132 • 4 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 111 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 47 • 1 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Datasets - CryptoSage
VLMs
Agents
Classifier Models
LLMs
Datasets - Pretraining
OCR/Document Processing
ActionLanguageModels
Datasets - MultiModal
Agent-Specific/Function-Calling Models
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.39k • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 14.3k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 7.72k • 29 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.71k • 60
MMMs
Models - CryptoSage
Datasets - Reasoning
Spaces
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 85
DataSets
Pokemon_Red_Experiments
Datasets - Pretraining
Benchmarks
OCR/Document Processing
Datasets - Agents
ActionLanguageModels
Datasets - Coding
Datasets - MultiModal
ARC-AGI2
Agent-Specific/Function-Calling Models
VLMs - Robotics
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.39k • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 14.3k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 7.72k • 29 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.71k • 60
Embedding Models
MMMs
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 132 • 4 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 111 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 47 • 1 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Models - CryptoSage
Datasets - CryptoSage
Datasets - Reasoning
VLMs
Spaces
Agents
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 85
Classifier Models
DataSets
LLMs