Kamino's repositories
watermark-tracer
一个基于可视水印检测识别的数字媒体溯源应用系统,是我的大作业项目,包含这个系统以及一个开源的大规模常见水印图像数据集(Large-scale Common Watermark Dataset, LCWD)。 输入一个带有可视水印的图片或视频,系统会检测定位到水印所在的区域,然后将其提取出来,然后借助百度AI开放平台的OCR和logo识别以及Bing搜索引擎,溯源到这个图片或视频的源头。
Video-Captioning-Transformer
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
S2VT-video-caption
An implementation of paper "Sequence to Sequence – Video to Text". This implementation uses the S2VT model to do video captioning(or video description) task.
video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet, CLIP features.
dangdang-analyse
爬取、分析当当网的图书评论数据,用来做大作业的
shabidianlu
**放大电路简便计算
vatex-downloader
A simple vatex dataset downloader. 一个简单的VATEX数据集(或其他YouTube视频数据集)的下载器,特别为国内网络环境优化(其实就是断点下载和加上代理的参数)。
163MusicSpider
一个获取网易云音乐歌手、专辑、歌曲、评论、歌词等数据的Python爬虫
a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
CBIR
🏞 A content-based image retrieval (CBIR) system
ChatGPT-Next-Web
一键拥有你自己的 ChatGPT 网页服务。 One-Click to deploy your own ChatGPT web UI.
cifar-pytorch-learning
LeNet5、AlexNet、VGG、GoogleNet、ResNet不同网络结构的尝试
CreationEngine
C++ OpenGL 模仿我的世界,内容相对完善,随机地图,支持双人联机,代码注释多
LAVIS-MMVCT
LAVIS - A One-stop Library for Language-Vision Intelligence
learn_cryptography
The Python3 implementation of MD5, SHA1 algorithms. Used for learning cryptography.
Machine-Learning-Notes
入门机器学习的笔记库
mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
mmt
Multi-Modal Transformer for Video Retrieval
PEL4VAD
Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection"
pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
RecNet
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
torch_videovision
Transforms for video datasets in pytorch
torchvggish
Pytorch port of Google Research's VGGish model used for extracting audio features.
vidat
Video Annotation Tool
wx-challenge
微信大赛baseline