Junkang Wu's picture

2 5

Junkang Wu

junkang0909

·

https://junkangwu.github.io/

AI & ML interests

LLM alignment

Recent Activity

upvoted a paper about 1 month ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

authored a paper about 1 month ago

Aligning Multimodal LLM with Human Preference: A Survey

authored a paper about 1 month ago

Robust Preference Optimization via Dynamic Target Margins

View all activity

Organizations

None yet

Papers 8

arxiv:2509.22611

arxiv:2506.03690

arxiv:2503.14504

arxiv:2503.07426

models 0

None public yet

datasets 0

None public yet