카이's repositories
ITstyler-Image-optimized-Text-based-Style-Transfer
Unofficial https://arxiv.org/abs/2301.10916
annotated_deep_learning_paper_implementations
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
awesome-3D-generation
A curated list of awesome 3d generation papers
content-moderation-deep-learning
Deep learning based content moderation from text, audio, video & image input modalities.
ControlNet
Let us control diffusion models
DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
DiffSynth-Studio
Enjoy the magic of Diffusion models!
diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
kiuikit
A toolkit for 3D computer vision tasks.
langchain
⚡ Building applications with LLMs through composability ⚡
latent-nerf
Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"
LION
Latent Point Diffusion Models for 3D Shape Generation
multinerf
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
pix2pix3D
pix2pix3D: Generating 3D Objects from 2D User Inputs
stable-dreamfusion
A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
STEAL
STEAL - Learning Semantic Boundaries from Noisy Annotations (CVPR 2019)
synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
TEXTurePaper
Official Implementation for "TEXTure: Semantic Texture Transfer using Text Tokens"
uvicorn-gunicorn-fastapi-docker
Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python with performance auto-tuning. Optionally with Alpine Linux.
volumegan
CVPR 2022 VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations
yolov10
YOLOv10: Real-Time End-to-End Object Detection
yolov5-seg-ncnn
c++ version of yolov5 segmentation with ncnn
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors