Chuanming Liu's picture

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Recent Activity

liked a model 4 days ago

mlx-community/whisper-large-v3-mlx

upvoted an article 4 days ago

Fine-Tune Whisper with 🤗 Transformers

liked a model 4 days ago

Banafo/Kroko-ASR

View all activity

Organizations

upvoted 2 articles 4 days ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 317

Article

Supercharge your OCR Pipelines with Open Models

8 days ago

• 210

upvoted 2 papers 17 days ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 132

Finite Scalar Quantization Enables Redundant and Transmission-Robust Neural Audio Compression at Low Bit-rates

Paper • 2509.09550 • Published Sep 11 • 2

upvoted 2 collections about 1 month ago

Qwen3Guard

7 items • Updated 29 days ago • 53

Qwen3-Omni

6 items • Updated 20 days ago • 162

upvoted an article about 1 month ago

Article

Understanding Vector Quantization in VQ-VAE

By

•

Aug 28, 2024

• 48

upvoted a paper about 2 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

upvoted an article about 2 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

By

•

Aug 9

• 47

upvoted 2 collections about 2 months ago

PP-StructureV3

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15 • 9

PP-OCRv5

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15 • 48

upvoted a paper 2 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 72

upvoted 3 collections 2 months ago

Marvis-TTS-250m-v0.1

5 items • Updated Aug 26 • 26

AFM-Datasets

Training datasets of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 6 items • Updated Aug 6 • 5

AFM-Models

The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated Aug 6 • 16

upvoted a paper 2 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

upvoted a collection 2 months ago

Seed-OSS

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

upvoted an article 3 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

By

and 4 others •

Jul 17

• 75

upvoted a collection 4 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 8 days ago • 111

upvoted an article 5 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3

• 93