deepseek-ai/DeepSeek-V3.1-Terminus Text Generation • 685B • Updated about 1 month ago • 16.7k • • 332
NextCoder Collection NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9 • 71
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Paper • 2505.16901 • Published May 22 • 47
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 8 days ago • 111
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub Jun 12 • 147