linlei
thinkaboutzero
AI & ML interests
natural language processing
Recent Activity
upvoted
a
paper
29 days ago
ASPO: Asymmetric Importance Sampling Policy Optimization
upvoted
a
paper
about 1 month ago
Attention as a Compass: Efficient Exploration for Process-Supervised RL
in Reasoning Models
liked
a model
2 months ago
Kwai-Klear/Klear-46B-A2.5B-Instruct
Organizations
None yet