arxiv:2509.22611
Kexin Huang
737443h
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
RePO: ReLU-based Preference Optimization
authored
a paper
about 1 month ago
SPRec: Self-Play to Debias LLM-based Recommendation
authored
a paper
about 1 month ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
Organizations
None yet