Xin Liu

xinliucs

AI & ML interests

None yet

Recent Activity

updated a Space about 6 hours ago

launch/factrbench

liked a dataset about 1 month ago

facebook/factual_reasoning

upvoted a paper 2 months ago

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

View all activity

Organizations

updated a Space about 6 hours ago

FactRBench

🏆

View and analyze long-form factuality leaderboard

liked a dataset about 1 month ago

facebook/factual_reasoning

Preview • Updated Sep 25 • 40 • 2

upvoted 2 papers 2 months ago

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published Aug 25 • 25

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

upvoted a paper 3 months ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12 • 39

liked a dataset 4 months ago

launch/FactRBench

Viewer • Updated Jun 9 • 1.06k • 26 • 1

upvoted a paper 4 months ago

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Paper • 2504.10823 • Published Apr 15 • 15

liked a dataset 4 months ago

launch/ExpertLongBench

Preview • Updated Jul 30 • 125 • 10

upvoted a paper 5 months ago

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Paper • 2506.01241 • Published Jun 2 • 9

updated a dataset 5 months ago

launch/FactRBench

Viewer • Updated Jun 9 • 1.06k • 26 • 1

upvoted a paper 6 months ago

VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts

Paper • 2505.09701 • Published May 14 • 2

upvoted a paper 7 months ago

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Paper • 2504.09702 • Published Apr 13 • 18

published a dataset 7 months ago

launch/FactRBench

Viewer • Updated Jun 9 • 1.06k • 26 • 1

liked a model almost 3 years ago

launch/POLITICS

Fill-Mask • Updated Apr 13 • 44 • 13

Xin Liu

AI & ML interests

Recent Activity

Organizations

xinliucs's activity

FactRBench