kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Home Page:https://discord.gg/qUtxnK2NMf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

need a distributed training example

sosofun opened this issue · comments

Thank you for your innovative work, can you provide a distributed training example?
then can quickly reproduct and verify thesis work。

@sosofun yes I can

Stale issue message

@sosofun try it out! it's been created