Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published 27 days ago • 52
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 14 days ago • 61
Factuality Matters: When Image Generation and Editing Meet Structured Visuals Paper • 2510.05091 • Published 28 days ago • 18
HAF-RM: A Hybrid Alignment Framework for Reward Model Training Paper • 2407.04185 • Published Jul 4, 2024
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 • 52
ARKS: Active Retrieval in Knowledge Soup for Code Generation Paper • 2402.12317 • Published Feb 19, 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling Paper • 2403.06754 • Published Mar 11, 2024
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation Paper • 2211.11501 • Published Nov 18, 2022