-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 11 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 76 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 1
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
published
a model
7 days ago
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned-test
published
a model
7 days ago
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned
published
a model
8 days ago
AmberYifan/Qwen3-4B-OpenR1Math-MARL-structure-v2
Organizations
LLMs Can Get "Brain Rot"!
-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 11 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 76 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 1
DRIFT
Learning from Abundant User Dissatisfaction in Real-World Preference Learning
models
639
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned-test
Updated
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned
Updated
AmberYifan/Qwen3-4B-OpenR1Math-MARL-structure-v2
Updated
AmberYifan/Qwen3-4B-Polaris-MARL-structure-v2
Updated
AmberYifan/Qwen3-4B-MATH-MARL-structure-v2
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-structure
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-tuned
Updated
AmberYifan/Qwen3-4B-MATH-MARL-tuned
Updated
AmberYifan/Qwen3-4B-MATH-GRPO-tuned
Updated
AmberYifan/Qwen3-4B-MATH-MARL-structure-loop-penalty-v2-32
Updated
datasets
28
AmberYifan/seed-data
Viewer
•
Updated
•
491
•
40
AmberYifan/dsat-data
Viewer
•
Updated
•
10.6k
•
19
AmberYifan/sat-data
Viewer
•
Updated
•
4.43k
•
22
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
21
AmberYifan/sft-spin-filter
Updated
•
6
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
4
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
6
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
10
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
17
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
7