Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
AI & ML interests
Language Model, Diffusion Language Model
Recent Activity
models
22
JetLM/SDAR-30B-A3B-Sci
Text Generation
•
31B
•
Updated
•
33
JetLM/SDAR-30B-A3B-Chat
Text Generation
•
31B
•
Updated
•
60
•
2
JetLM/SDAR-8B-Chat
Text Generation
•
8B
•
Updated
•
122
•
2
JetLM/SDAR-4B-Chat
Text Generation
•
4B
•
Updated
•
1.46k
•
2
JetLM/SDAR-1.7B-Chat
Text Generation
•
2B
•
Updated
•
1.25k
•
7
JetLM/SDAR-30B-A3B-Chat-b8
Text Generation
•
31B
•
Updated
•
12
JetLM/SDAR-30B-A3B-Chat-b64
Text Generation
•
31B
•
Updated
•
13
JetLM/SDAR-30B-A3B-Chat-b16
Text Generation
•
31B
•
Updated
•
15
JetLM/SDAR-30B-A3B-Chat-b32
Text Generation
•
31B
•
Updated
•
11
JetLM/SDAR-8B-Chat-b64
Text Generation
•
8B
•
Updated
•
17
datasets
0
None public yet