Giters
XuezheMax
/
megalodon
Reference implementation of Megalodon 7B model
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
499
Watchers:
14
Issues:
7
Forks:
51
XuezheMax/megalodon Issues
The project is very interesting, thanks for publishing it, now I have a few questions
Updated
a month ago
Comments count
1
any opensource weights?
Updated
2 months ago
Comments count
2
Cuda 11.8/12.1
Closed
2 months ago
Comments count
4
Question about the number of attention chunks in the paper
Closed
3 months ago
Comments count
3
Flash Attention V2 vs Megalodon Swift Attention
Closed
3 months ago
Comments count
2
How to save model and evaluate on downstream LLM
Updated
3 months ago
Failed to install megalodon on V100.
Closed
3 months ago
Comments count
4