2 37 14

haoxintong

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

upvoted a paper 17 days ago

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

upvoted a paper about 1 month ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

View all activity

Organizations

upvoted a paper 13 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published 15 days ago • 90

upvoted a paper 17 days ago

RePro: Training Language Models to Faithfully Recycle the Web for Pretraining

Paper • 2510.10681 • Published 22 days ago • 5

upvoted 2 papers about 1 month ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 76

Synthetic bootstrapped pretraining

Paper • 2509.15248 • Published Sep 17 • 8

upvoted a paper 2 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

liked 2 models 2 months ago

ByteDance-Seed/Seed-OSS-36B-Base-woSyn

Text Generation • 36B • Updated Aug 26 • 353 • 51

ByteDance-Seed/Seed-OSS-36B-Base

Text Generation • 36B • Updated Aug 26 • 2.95k • 53

upvoted a collection 2 months ago

Seed-OSS

Collection

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

liked a dataset 3 months ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Sep 18 • 228k • 408 • 75

upvoted 3 papers 3 months ago

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 19

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 113

GR-3 Technical Report

Paper • 2507.15493 • Published Jul 21 • 47

upvoted a paper 4 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 708

upvoted 2 papers 4 months ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published Jun 26 • 28

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

upvoted 4 papers 5 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 268

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 102

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published Jun 6 • 6

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

haoxintong

AI & ML interests

Recent Activity

Organizations

haoxintong's activity

SmolLM3: smol, multilingual, long-context reasoner