huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Home Page:https://huggingface.co/docs/timm

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[FEATURE] Add Hiera

raulcarlomagno opened this issue · comments

Add a vision model from Meta

"Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles"
https://github.com/facebookresearch/hiera/tree/main

image

@raulcarlomagno I like this model quite a bit, neat ideas, but they've marked both the code and weights as non-commercial. I can deal with the weights, I treat them with separate licenses on the HF hub, but cannot bring NC code into timm...

Given that, it takes more effort to do a clean room impl / from first principles and I have a lot of things in progress right now. Or you could bug them to drop the NC license on the code and just keep it for the weights...