Datasets scripts
kyegomez opened this issue · comments
Eternal Reclaimer commented
pile v2 + redpajama is what RKWV is training on rn. that's a 1.7T token dataset.
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
kyegomez opened this issue · comments
pile v2 + redpajama is what RKWV is training on rn. that's a 1.7T token dataset.