kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Home Page:https://discord.gg/qUtxnK2NMf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Question] How did you implement 1-bit tensor?

Beomi opened this issue · comments

Hi, I found this repository when I'm plan to impelment BitNet.

BitLinear use 1-bit, but, since pytorch native dtype does not supports 1bit tensor,
So i thought I need to implement via custom cuda kernel.(bit packing and unpacking)

How did u implement 1bit tensor implementation?
(I can't find cuda files yet)

i'll be interested to know this as well🤔

Stale issue message