6 169 193

Inui

Norm

https://normxu.github.io/

AI & ML interests

Video Diffusion; Large Language Model; Object Detection; OCR

Recent Activity

liked a model 1 day ago

meituan-longcat/LongCat-Flash-Omni

upvoted a paper 22 days ago

Less is More: Recursive Reasoning with Tiny Networks

liked a model about 1 month ago

rednote-hilab/dots.ocr

View all activity

Organizations

liked a model 1 day ago

meituan-longcat/LongCat-Flash-Omni

Text Generation • 561B • Updated 3 days ago • 88 • 62

upvoted a paper 22 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 28 days ago • 462

liked a model about 1 month ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated 3 days ago • 1.13M • 1.11k

liked a model 2 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24 • 22.7k • 498

upvoted a paper 2 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 123

liked a model 2 months ago

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1 • 210k • 1.94k

upvoted a paper 3 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12 • 36

liked a model 3 months ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25 • 199k • • 2.08k

upvoted 3 papers 3 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published Jul 25 • 28

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

Paper • 2507.19427 • Published Jul 25 • 18

upvoted 2 papers 4 months ago

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 112

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 126

upvoted 5 papers 5 months ago

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 26

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 268

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

One-shot Entropy Minimization

Paper • 2505.20282 • Published May 26 • 6

upvoted 2 papers 6 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 96

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

Inui

AI & ML interests

Recent Activity

Organizations

Norm's activity