小明

xiaoming

xiaominghero

AI & ML interests

nlp

Recent Activity

liked a dataset 19 days ago

allenai/CoSyn-400K

upvoted a collection about 1 month ago

MobileLLM-R1

liked a dataset about 2 months ago

allenai/WildChat-4.8M

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 7 items • Updated 15 days ago • 19

upvoted 2 papers 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 201

DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

upvoted a collection 2 months ago

Nemotron-Pre-Training-Dataset

Collection

7 items • Updated 7 days ago • 40

upvoted a paper 2 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

upvoted an article 3 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 702

upvoted a collection 3 months ago

SmolDocling datasets

Collection

Datasets used to train SmolDocling • 6 items • Updated Jul 31 • 30

upvoted 3 papers 3 months ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 112

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 65

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted a collection 4 months ago

Kimi-K2

Collection

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 3 items • Updated 2 days ago • 128

upvoted 2 papers 4 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 267

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 70

upvoted a collection 7 months ago

Document AI

Collection

38 items • Updated about 23 hours ago • 3

upvoted a paper 7 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

upvoted a paper 8 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

upvoted a paper 10 months ago

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

Paper • 1906.03327 • Published Jun 7, 2019 • 1

小明

AI & ML interests

Recent Activity

Organizations

xiaoming's activity

SmolLM3: smol, multilingual, long-context reasoner