Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
11
Jian Guan
Jiann
Follow
buaa42wxy's profile picture
yangjunxiao2021's profile picture
2 followers
·
3 following
https://jianguanthu.github.io/
JianGuanTHU
AI & ML interests
Natural language generation;storytelling
Recent Activity
updated
a dataset
2 days ago
Jiann/GS-Reasoner-Data
published
a dataset
2 days ago
Jiann/GS-Reasoner-Data
upvoted
a
paper
8 days ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
View all activity
Organizations
Papers
15
arxiv:
2506.09965
arxiv:
2504.02438
arxiv:
2503.17003
arxiv:
2503.02324
Expand 15 papers
models
3
Sort: Recently updated
Jiann/TestModel
8B
•
Updated
Jul 21
•
3
Jiann/AMOR-warmup
Updated
Nov 5, 2024
•
1
Jiann/AMOR-adaptation
10B
•
Updated
Nov 5, 2024
•
1
datasets
9
Sort: Recently updated
Jiann/GS-Reasoner-Data
Updated
2 days ago
•
4
Jiann/STORAL
Viewer
•
Updated
Nov 7, 2024
•
21.8k
•
219
•
2
Jiann/UNION_DATA
Viewer
•
Updated
Nov 6, 2024
•
600
•
75
Jiann/OpenMEVA
Updated
Nov 6, 2024
•
30
Jiann/AMOR_warmup_data
Preview
•
Updated
Nov 5, 2024
•
14
Jiann/LOT
Preview
•
Updated
Nov 5, 2024
•
37
Jiann/QA
Updated
Aug 5, 2024
•
51
•
1
Jiann/UnifiedPreferenceDataset2
Viewer
•
Updated
Jul 31, 2024
•
1M
•
147
Jiann/UnifiedPreferenceDataset
Viewer
•
Updated
Jul 31, 2024
•
2.34M
•
267
•
1