Giters
neoremind
/
llama2.java
Inference Llama 2 in one file of pure Java
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
14
Watchers:
2
Issues:
1
Forks:
2
neoremind/llama2.java Issues
[Question] Can Jdk 21 's Vector API improve inference performance under single threads?
Closed
7 months ago
Comments count
3