inclusionAI
Team
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling
ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
The Agent Runtime for Self-Improvement
-
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Paper • 2508.10833 • Published • 43 -
inclusionAI/UI-Venus-Ground-7B
Image-Text-to-Text • 8B • Updated • 1.51k • 19 -
inclusionAI/UI-Venus-Ground-72B
Image-Text-to-Text • 73B • Updated • 280 • 11 -
inclusionAI/UI-Venus-Navi-7B
Image-Text-to-Text • 8B • Updated • 522 • 10
-
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper • 2506.09344 • Published • 28 -
inclusionAI/Ming-Lite-Omni
Any-to-Any • 19B • Updated • 65 • 192 -
inclusionAI/Ming-Lite-Omni-1.5
Any-to-Any • 19B • Updated • 769 • 81 -
inclusionAI/Ming-UniAudio-16B-A3B
Any-to-Any • 18B • Updated • 3.74k • 53
-
inclusionAI/Ming-flash-omni-Preview
Any-to-Any • 104B • Updated • 5.34k • 48 -
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
Paper • 2510.24821 • Published • 27 -
inclusionAI/MingTok-Vision
Image Feature Extraction • 0.7B • Updated • 971 • 31 -
inclusionAI/Ming-UniVision-16B-A3B
Any-to-Any • 19B • Updated • 340 • 58
GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute.
AReaL-boba-2
-
inclusionAI/Ming-flash-omni-Preview
Any-to-Any • 104B • Updated • 5.34k • 48 -
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
Paper • 2510.24821 • Published • 27 -
inclusionAI/MingTok-Vision
Image Feature Extraction • 0.7B • Updated • 971 • 31 -
inclusionAI/Ming-UniVision-16B-A3B
Any-to-Any • 19B • Updated • 340 • 58
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
The Agent Runtime for Self-Improvement
GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute.
-
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Paper • 2508.10833 • Published • 43 -
inclusionAI/UI-Venus-Ground-7B
Image-Text-to-Text • 8B • Updated • 1.51k • 19 -
inclusionAI/UI-Venus-Ground-72B
Image-Text-to-Text • 73B • Updated • 280 • 11 -
inclusionAI/UI-Venus-Navi-7B
Image-Text-to-Text • 8B • Updated • 522 • 10
AReaL-boba-2
-
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper • 2506.09344 • Published • 28 -
inclusionAI/Ming-Lite-Omni
Any-to-Any • 19B • Updated • 65 • 192 -
inclusionAI/Ming-Lite-Omni-1.5
Any-to-Any • 19B • Updated • 769 • 81 -
inclusionAI/Ming-UniAudio-16B-A3B
Any-to-Any • 18B • Updated • 3.74k • 53