Giters
AI-Hypercomputer
/
maxtext
A simple, performant and scalable Jax LLM!
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
1491
Watchers:
32
Issues:
92
Forks:
279
AI-Hypercomputer/maxtext Issues
A pip error occurs when running setup.sh.
Closed
8 months ago
Comments count
1
Problems with a parameter checkpoint after training llama2-7b
Closed
8 months ago
Comments count
1
Issues running decode example from readme
Closed
8 months ago
Comments count
1
Issues running end_to_end/test_mistral.sh
Closed
8 months ago
Comments count
7
[request] bloom (alibi) model implementation
Closed
8 months ago
Comments count
1
Should non-pod multihost be possible on TPU v2s/v3s?
Closed
8 months ago
Comments count
3
`nextrng` not checkpointed, consider using `fold_in(config.seed, step)`
Closed
9 months ago
Comments count
2
Long sequences are dropped rather than trimmed
Closed
8 months ago
Comments count
2
XlaRuntimeError when training with bfloat16 activations on TPU v3-8
Closed
a year ago
Comments count
3
Local development instructions don't work
Closed
a year ago
Comments count
5
Do the Attentions / MLPs run in parallel?
Closed
a year ago
Comments count
1
Jobs in kubernetes exceeds the limit of 40 characters
Closed
a year ago
Comments count
4
TPUv2-8 multislice
Closed
a year ago
Comments count
2
You don't have to
Closed
a year ago
Comments count
1
maxtext on Colab TPU
Closed
a year ago
Comments count
1
load_parameters_path=gs:// deletes directory
Closed
a year ago
pip install maxtext
Closed
a year ago
Comments count
2
README is missing instructions for `dataset_path` flag
Closed
a year ago
Comments count
1
52B example sharding error
Closed
a year ago
Comments count
3
multihost_runner.py: number of devices does not match the product of the parallelism
Closed
a year ago
Comments count
3
FAILED_PRECONDITION: TPU platform already registered for platform hardware version
Closed
a year ago
Comments count
1
Training spikes with FSDP
Closed
a year ago
Comments count
2
any larger model test?
Closed
a year ago
Comments count
1
How to calculate MFU as shown in readme?
Closed
a year ago
Comments count
5
Previous
Next