11 22 12

Le Zhuo

JackyZhuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

PICABench: How Far Are We from Physically Realistic Image Editing?

updated a dataset 15 days ago

JackyZhuo/PICABenchV1-intermediate

published a dataset 15 days ago

JackyZhuo/PICABenchV1-intermediate

View all activity

Organizations

upvoted a paper 8 days ago

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published 9 days ago • 60

upvoted a collection 20 days ago

StructVisuals

Collection

StructBench and StructVisuals (Training Set) • 4 items • Updated 20 days ago • 4

upvoted a paper 22 days ago

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published 23 days ago • 18

upvoted a paper 27 days ago

Video Background Music Generation: Dataset, Method and Evaluation

Paper • 2211.11248 • Published Nov 21, 2022 • 1

upvoted a paper 6 months ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published May 1 • 44

upvoted a collection 6 months ago

ReflectionFlow release

Collection

https://diffusion-cot.github.io/reflection2perfection/ • 6 items • Updated Apr 23 • 13

upvoted 2 papers 6 months ago

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Paper • 2504.16080 • Published Apr 22 • 15

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51

upvoted 2 papers 7 months ago

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published Apr 10 • 50

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Paper • 2503.21758 • Published Mar 27 • 22

upvoted a collection 9 months ago

Open Image Preferences

Collection

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 11

upvoted a paper 9 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 19

upvoted a paper 10 months ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 81

upvoted 2 papers 11 months ago

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

Paper • 2412.09428 • Published Dec 12, 2024 • 7

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Paper • 2411.14794 • Published Nov 22, 2024 • 13

upvoted 3 papers about 1 year ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 25

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Paper • 2408.15881 • Published Aug 28, 2024 • 21

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Paper • 2408.02657 • Published Aug 5, 2024 • 35

upvoted a paper about 2 years ago

3D-GPT: Procedural 3D Modeling with Large Language Models

Paper • 2310.12945 • Published Oct 19, 2023 • 59

upvoted a paper over 2 years ago

Brain2Music: Reconstructing Music from Human Brain Activity

Paper • 2307.11078 • Published Jul 20, 2023 • 41

Le Zhuo

AI & ML interests

Recent Activity

Organizations

JackyZhuo's activity