Giters
yandex
/
YaLM-100B
Pretrained language model with 100B parameters
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
3745
Watchers:
48
Issues:
28
Forks:
298
yandex/YaLM-100B Issues
How to use it with LangChain?
Updated
7 months ago
Comments count
2
gguf / mlx format?
Updated
10 months ago
Why usage ssh-agent and openssh-client package in docker
Updated
a year ago
Timeout on 8 x RTX A6000
Updated
a year ago
Comments count
2
Request to Open "Russian Pile" Dataset for Public Access
Updated
2 years ago
Provide pruned version for weaker hardware
Updated
2 years ago
Comments count
2
Citation bibtex?
Closed
2 years ago
Comments count
2
CUDA out of memory
Closed
2 years ago
Comments count
6
Has anyone deployed it on 10x 3090 ? Or any similar configuration?
Updated
2 years ago
Comments count
1
Is there any plans for making cloud service?
Updated
2 years ago
Comments count
1
PCI x1 or PCI x16 for GPU
Updated
2 years ago
NCCL error
Closed
2 years ago
Comments count
1
Would it be possible to run the model on single A100 (40GB) or 2xV100 (32GB) ?
Closed
2 years ago
Comments count
2
ZeRO 3 NVMe Offload?
Closed
2 years ago
Comments count
9
No mention of `bfloat16` in source, and yet weights are `bfloat16`
Updated
2 years ago
Could you share the md5 value for those checkpoints?
Closed
2 years ago
Comments count
2
Can it be launched on usual VPS? For example, 6 CPU 16 RAM (usual chips)
Updated
2 years ago
Comments count
2
Online examples
Updated
2 years ago
Comments count
9
AWS
Updated
2 years ago
Comments count
1
Run on networked nodes
Updated
2 years ago
Dataset information
Closed
2 years ago
Comments count
2
[NL] token
Closed
2 years ago
Comments count
2
How did you used LAMB optimizer with ZeRO CPU offload?
Closed
2 years ago
Comments count
2
Possible to run on 8 x 24GB 3090?
Updated
2 years ago
Comments count
3
Привет
Closed
2 years ago
Comments count
2
Model dataset irregularity
Closed
2 years ago
YaLm
Closed
2 years ago
Comments count
1
Evaluation benchmarks (lm-eval-harness)
Updated
2 years ago