SPLADE-Tiny-MSMARCO Collection SPLADE sparse retrieval models based on BERT-Tiny (4M) and BERT-Mini (11M) distilled from a Cross-Encoder on the MSMARCO dataset • 6 items • Updated 6 days ago • 1
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 13 days ago • 72
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 145
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published 28 days ago • 7