Ziqi wang's picture

3 10

Ziqi wang

wzq016

·

https://wzq016.github.io

AI & ML interests

NLP

Organizations

upvoted 2 papers 4 months ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10 • 74

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 88

upvoted 2 papers 5 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

Paper • 2506.06444 • Published Jun 6 • 73

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 138

upvoted a paper 6 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 78

upvoted a paper 8 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

upvoted a paper about 1 year ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

upvoted a paper over 1 year ago

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 9

upvoted a collection over 1 year ago

Model Extrapolation Expedites Alignment

Better aligned models obtained by model extrapolation (ExPO) • 25 items • Updated May 27 • 17

upvoted a paper over 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11