xujinbao282's repositories
AnalysisAVP
音视频学习,相关文件格式/协议分析,框架学习等。encode decode;rgb yuv h264 aac flv mp4 rtmp;libyuv x264 openh264 faac faad2 fdk-aac librtmp ffmpeg sdl2 webrtc;android ios capture videotoolbox;
Barbershop
Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
CV
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
ffhq-dataset
Flickr-Faces-HQ Dataset (FFHQ)
gost
GO Simple Tunnel - a simple tunnel written in golang
grok-1
Grok open release
imgaug
Image augmentation for machine learning experiments.
Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
OMG
OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
opencv
Open Source Computer Vision Library
pumpkin-book
《机器学习》(西瓜书)公式详解
roop
one-click deepfake (face swap)
srs
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
stable-diffusion
A latent text-to-image diffusion model
stable-diffusion-webui
Stable Diffusion web UI
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper.cpp
Port of OpenAI's Whisper model in C/C++
x-ui
支持多协议多用户的 xray 面板
Xray-core
Xray, Penetrates Everything. Also the best v2ray-core, with XTLS support. Fully compatible configuration.
Xray-examples
Some examples of uses for Xray-core.