fly2fly's repositories
Cross-modal-retrieval
媒体计算实践作业:图像——文本跨模态搜索
arxivbox
Web interface for browsing arXiv papers
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
Cross-modal_Retrieval_Tutorial
The Paper List of Cross-Modal Matching for Preliminary Insight.
cs228-material
Teaching materials for the probabilistic graphical models and deep learning classes at Stanford
cs536_compiler
CS536 Intro to PLs and Compilers lab
Fly2flies.github.io
personal blog
fun-rec
推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/
grid-feats-vqa
Grid features pre-training code for visual question answering
lihang-code
《统计学习方法》的代码实现
MachineLearningNotes
My personal notes
ML_Notes
机器学习算法的公式推导以及numpy实现
MNAD
An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.
NExT-QA
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
PAT
🍭 浙江大学PAT题解(C/C++/Java/Python) - 努力成为萌萌的程序媛~
py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
pytorch-cnn-visualizations
Pytorch implementation of convolutional neural network visualization techniques
ResnetGPT
用Resnet101+GPT搭建一个玩王者荣耀的AI
stable-diffusion-webui
Stable Diffusion web UI
TCL
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
TRAR-VQA
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
Z-Lab
Z Lab数据实验室开源代码汇总