39 178 43

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 3 days ago

Tongyi DeepResearch Technical Report

upvoted a paper 3 days ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

upvoted a paper 3 days ago

ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization

View all activity

Organizations

upvoted 3 papers 3 days ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 5 days ago • 82

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 4 days ago • 41

ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization

Paper • 2510.24592 • Published 5 days ago • 49

authored a paper 6 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 9 days ago • 91

upvoted 3 papers 6 days ago

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

Paper • 2510.14847 • Published 17 days ago • 55

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published 14 days ago • 90

A Definition of AGI

Paper • 2510.18212 • Published 12 days ago • 33

New activity in dongguanting/Qwen2.5-7B-AEPO 6 days ago

Update pipeline tag

#2 opened 12 days ago by

nielsr

New activity in dongguanting/Qwen3-8B-AEPO-DeepSearch 6 days ago

Update pipeline tag

#2 opened 12 days ago by

nielsr

upvoted a paper 6 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 9 days ago • 91

upvoted a paper 10 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published 12 days ago • 82

upvoted 2 papers 11 days ago

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published 17 days ago • 79

Chem-R: Learning to Reason as a Chemist

Paper • 2510.16880 • Published 14 days ago • 52

authored a paper 12 days ago

Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation

Paper • 2510.17354 • Published 13 days ago • 32

upvoted 3 papers 12 days ago

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Paper • 2510.14958 • Published 17 days ago • 22

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published 17 days ago • 32

BitNet Distillation

Paper • 2510.13998 • Published 18 days ago • 52

updated a collection 12 days ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 4 items • Updated 12 days ago • 3

New activity in dongguanting/Qwen3-14B-AEPO-DeepSearch 12 days ago

Improve model card for Qwen3-14B-AEPO-DeepSearch with pipeline tag, Transformers library, and GitHub link

#1 opened 15 days ago by

nielsr

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity

Update pipeline tag

Update pipeline tag

Improve model card for Qwen3-14B-AEPO-DeepSearch with pipeline tag, Transformers library, and GitHub link