Sayak Paul's picture

Sayak Paul PRO

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a dataset about 4 hours ago

huggingface/diffusers-metadata

replied to their post 3 days ago

Diffusers supports a good variety of quantization backends. It can be challenging to navigate through them, given the complex nature of diffusion pipelines in general. So, @derekl35 set out to write a comprehensive guide that puts users in the front seat. Explore the different backends we support, learn the trade-offs they offer, and finally, check out the cool space we built that lets you compare quantization results. Give it a go here: https://lnkd.in/gf8Pi4-2

upvoted a paper 7 days ago

FineVision: Open Data Is All You Need

View all activity

Organizations

updated a dataset about 4 hours ago

huggingface/diffusers-metadata

Viewer • Updated about 4 hours ago • 79 • 952 • 10

replied to their post 3 days ago

https://github.com/huggingface/diffusers/pull/12207

Cannot do much beyond this at this point. There are a couple of things very unclear.

upvoted a paper 7 days ago

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published 8 days ago • 57

updated a model 7 days ago

diffusers-internal-dev/qwen-prompt-expander

Updated 7 days ago

published a model 7 days ago

diffusers-internal-dev/qwen-prompt-expander

Updated 7 days ago

liked a Space 11 days ago

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

updated a Space 13 days ago

Benchmark Analyzer

Analyze Diffusers benchmarks

updated a dataset 13 days ago

diffusers/benchmarks

Viewer • Updated 13 days ago • 13 • 49 • 14

liked a dataset 19 days ago

JackyZhuo/SructVisuals

Viewer • Updated 19 days ago • 1.34M • 1.31k • 3

authored a paper 21 days ago

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published 22 days ago • 17

upvoted a collection 21 days ago

StructVisuals

StructBench and StructVisuals (Training Set) • 4 items • Updated 19 days ago • 4

upvoted a paper 21 days ago

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published 22 days ago • 17

liked a Space 27 days ago

Qwen Image Edit 2509

Generate edited images based on prompts and input images

upvoted a paper 28 days ago

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published 29 days ago • 43

updated a Space about 1 month ago

Optimized Diffusers Code

Optimize Diffusers Code on your hardware.

updated a dataset about 1 month ago

huggingface/diffusers-metadata

Viewer • Updated about 4 hours ago • 79 • 952 • 10

updated 2 models about 1 month ago

diffusers-internal-dev/gemini-prompt-expander

Updated Sep 18 • 5 • 1

diffusers-internal-dev/canny-filtering

updated a collection about 1 month ago

Modular Diffusers Custom Blocks

Custom blocks for Modular Diffusers • 8 items • Updated 27 days ago • 2