Kevin King PRO
NeoCodes-dev
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
updated
a collection
1 day ago
Research Papers
upvoted
an
article
10 days ago
Building the Open Agent Ecosystem Together: Introducing OpenEnv
updated
a collection
27 days ago
Research Papers
Organizations
ActionLanguageModels
Datasets - MultiModal
Agent-Specific/Function-Calling Models
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.67k • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 6.18k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 2.21k • 27 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.55k • 60
MMMs
Models - CryptoSage
Datasets - Reasoning
Spaces
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 84
DataSets
Datasets - Agents
Datasets - Coding
ARC-AGI2
VLMs - Robotics
Embedding Models
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 109 • 2 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 11 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 16 • 1 -
Sleeping1313
CrewAI Gradio Support Agent
👁Build support agent with CrewAI multi-agents and Gradio
Datasets - CryptoSage
VLMs
Agents
Classifier Models
LLMs
OCR/Document Processing
Datasets - Agents
ActionLanguageModels
Datasets - Coding
Datasets - MultiModal
ARC-AGI2
Agent-Specific/Function-Calling Models
VLMs - Robotics
Datasets - Robotics
-
nvidia/PhysicalAI-Robotics-Manipulation-Kitchen
Viewer • Updated • 405k • 1.67k • 10 -
nvidia/PhysicalAI-Robotics-Manipulation-SingleArm
Updated • 6.18k • 13 -
nvidia/PhysicalAI-SimReady-Warehouse-01
Viewer • Updated • 753 • 2.21k • 27 -
manycore-research/SpatialLM-Testset
Viewer • Updated • 107 • 1.55k • 60
Embedding Models
MMMs
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 109 • 2 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 11 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 16 • 1 -
Sleeping1313
CrewAI Gradio Support Agent
👁Build support agent with CrewAI multi-agents and Gradio
Models - CryptoSage
Datasets - CryptoSage
Datasets - Reasoning
VLMs
Spaces
Agents
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 84
Classifier Models
DataSets
LLMs