Giters
myshell-ai
/
JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
947
Watchers:
8
Issues:
9
Forks:
75
myshell-ai/JetMoE Issues
Parameter mapping
Updated
2 months ago
Question about the Chat_template
Updated
3 months ago
Pretraining dataset and code request
Updated
3 months ago
Comments count
5
Training script
Updated
3 months ago
Comments count
2
Why not mixtral arch?
Closed
3 months ago
Comments count
1
Finetuning
Updated
3 months ago
Comments count
1
where are those downloaded model stored?
Updated
4 months ago
KeyError: 'jetmoe' for jetmoe-8b-chat
Updated
4 months ago
Comments count
7
What is the minimum GPU configurations for training?
Updated
4 months ago