optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool