Generative AI Beyond LLMs: System Implications of Multi-Modal Generation Paper • 2312.14385 • Published Dec 22, 2023 • 7
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 80