Giters
myshell-ai
/
JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
936
Watchers:
8
Issues:
8
Forks:
72
myshell-ai/JetMoE Issues
Question about the Chat_template
Updated
2 months ago
Pretraining dataset and code request
Updated
2 months ago
Comments count
5
Training script
Updated
2 months ago
Comments count
2
Why not mixtral arch?
Closed
2 months ago
Comments count
1
Finetuning
Updated
2 months ago
Comments count
1
where are those downloaded model stored?
Updated
2 months ago
KeyError: 'jetmoe' for jetmoe-8b-chat
Updated
2 months ago
Comments count
7
What is the minimum GPU configurations for training?
Updated
2 months ago