COS's picture

8 2

COS

Linn3a

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

upvoted a paper 16 days ago

Rethinking Entropy Regularization in Large Reasoning Models

upvoted a paper 19 days ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet