Marcos Henrique

wakeupmh

wakeupmh

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

upvoted a paper 2 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

liked a model 2 months ago

zju-community/matchanything_eloftr

View all activity

Organizations

upvoted a paper 20 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published 27 days ago • 112

upvoted a paper 2 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

liked a model 2 months ago

zju-community/matchanything_eloftr

16.1M • Updated Aug 21 • 2.66k • 75

upvoted 11 papers 4 months ago

Training for X-Ray Vision: Amodal Segmentation, Amodal Content Completion, and View-Invariant Object Representation from Multi-Camera Video

Paper • 2507.00339 • Published Jul 1 • 12

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28 • 12

Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention

Paper • 2506.23542 • Published Jun 30 • 14

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published Jun 20 • 63

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5 • 79

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 140

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 113

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 54

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

DoTA-RAG: Dynamic of Thought Aggregation RAG

Paper • 2506.12571 • Published Jun 14 • 50

liked a Space 4 months ago

555

Image to Music v2

🎺

Get a music sample inspired by the mood of an image

upvoted 2 articles 5 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

• 271

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 225

liked a Space 6 months ago

Blazing Fast Whisper

👁

Blazing Fast Whisper Deployed on HF Inference Endpoints

upvoted an article 6 months ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

May 13

• 79

liked a model 6 months ago

HuggingFaceTB/SmolVLM-500M-Instruct

Image-Text-to-Text • 0.5B • Updated Apr 8 • 85.2k • 182

Marcos Henrique

AI & ML interests

Recent Activity

Organizations

wakeupmh's activity

Image to Music v2

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Blazing Fast Whisper

Blazingly fast whisper transcriptions with Inference Endpoints