HuiminWuHuiminWu's starred repositories
xlstm-resources
Resources about xLSTM by Sepp Hochreiter
FlowDiffusion_pytorch
Unofficial pytorch implementation of DDVM.
MIM-Depth-Estimation
This is an official implementation of our CVPR 2023 paper "Revealing the Dark Secrets of Masked Image Modeling" on Depth Estimation.
Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
pytorchviz
A small package to create visualizations of PyTorch execution graphs
Transformer_Relative_Position_PyTorch
Implement the paper "Self-Attention with Relative Position Representations"
video2dataset
Easily create large video dataset from video urls
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Transformer-in-Vision
Recent Transformer-based CV and related works.
random_quantize
a novel data augmentation method across data modalities