-
Language models are weak learners
Paper • 2306.14101 • Published • 10 -
Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence
Paper • 2306.07075 • Published • 10 -
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT
Paper • 2307.08674 • Published • 48 -
Nougat: Neural Optical Understanding for Academic Documents
Paper • 2308.13418 • Published • 40
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11419
-
Kosmos-2: Grounding Multimodal Large Language Models to the World
Paper • 2306.14824 • Published • 34 -
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Paper • 2310.02992 • Published • 4 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper • 2309.16058 • Published • 56
-
microsoft/Phi-4-mini-flash-reasoning
Text Generation • 4B • Updated • 4.36k • 244 -
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
Paper • 2507.14241 • Published • 17 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
Self-Adapting Language Models
Paper • 2506.10943 • Published • 6
-
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 15 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 17 -
U-Net: Convolutional Networks for Biomedical Image Segmentation
Paper • 1505.04597 • Published • 14
-
MEGA: Multilingual Evaluation of Generative AI
Paper • 2303.12528 • Published -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 15 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
A Unified View of Masked Image Modeling
Paper • 2210.10615 • Published
-
Language models are weak learners
Paper • 2306.14101 • Published • 10 -
Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence
Paper • 2306.07075 • Published • 10 -
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT
Paper • 2307.08674 • Published • 48 -
Nougat: Neural Optical Understanding for Academic Documents
Paper • 2308.13418 • Published • 40
-
microsoft/Phi-4-mini-flash-reasoning
Text Generation • 4B • Updated • 4.36k • 244 -
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
Paper • 2507.14241 • Published • 17 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
Self-Adapting Language Models
Paper • 2506.10943 • Published • 6
-
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 15 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 17 -
U-Net: Convolutional Networks for Biomedical Image Segmentation
Paper • 1505.04597 • Published • 14
-
MEGA: Multilingual Evaluation of Generative AI
Paper • 2303.12528 • Published -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 15 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
A Unified View of Masked Image Modeling
Paper • 2210.10615 • Published
-
Kosmos-2: Grounding Multimodal Large Language Models to the World
Paper • 2306.14824 • Published • 34 -
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Paper • 2310.02992 • Published • 4 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 55 -
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper • 2309.16058 • Published • 56