kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Home Page:https://discord.gg/qUtxnK2NMf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Consider techniques from official training paper

EwoutH opened this issue · comments

Microsoft released a new paper, which contains details and tips on training a ternary LLM. Might be useful!

@EwoutH Yes, i have integrated the new codes!

Stale issue message