Nirvana Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism YuhuaJiang/Nirvana-pro 2B • Updated 20 days ago • 23 • 2 YuhuaJiang/Nirvana-simple 2B • Updated 20 days ago • 18 • 1 YuhuaJiang/Nirvana 2B • Updated 20 days ago • 17 • 1
SDAR The models without suffixes use the default block size = 4. JetLM/SDAR-1.7B-Chat Text Generation • 2B • Updated 13 days ago • 1.27k • 7 JetLM/SDAR-4B-Chat Text Generation • 4B • Updated 13 days ago • 1.88k • 2 JetLM/SDAR-8B-Chat Text Generation • 8B • Updated 13 days ago • 122 • 2 JetLM/SDAR-30B-A3B-Chat Text Generation • 31B • Updated 13 days ago • 60 • 2
Nirvana Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism YuhuaJiang/Nirvana-pro 2B • Updated 20 days ago • 23 • 2 YuhuaJiang/Nirvana-simple 2B • Updated 20 days ago • 18 • 1 YuhuaJiang/Nirvana 2B • Updated 20 days ago • 17 • 1
SDAR The models without suffixes use the default block size = 4. JetLM/SDAR-1.7B-Chat Text Generation • 2B • Updated 13 days ago • 1.27k • 7 JetLM/SDAR-4B-Chat Text Generation • 4B • Updated 13 days ago • 1.88k • 2 JetLM/SDAR-8B-Chat Text Generation • 8B • Updated 13 days ago • 122 • 2 JetLM/SDAR-30B-A3B-Chat Text Generation • 31B • Updated 13 days ago • 60 • 2