buiduchanh

Kaiden's repositories

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.

Language:C++Apache-2.0000

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

NOASSERTION000

chatbox

Chatbox is a desktop app for GPT/LLM that supports Windows, Mac, Linux & Web Online

Language:TypeScriptGPL-3.0000

Cloth2Tex

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

Language:PythonGPL-3.0000

CnSTD

CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包

Language:PythonApache-2.0000

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++Apache-2.0000

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image (uncensored)

AGPL-3.0000

DemoFusion

Let us democratise high-resolution generation! (arXiv 2023)

Language:Jupyter Notebook000

DifFace

DifFace: Blind Face Restoration with Diffused Error Contraction (PyTorch)

NOASSERTION000

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Apache-2.0000

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonNOASSERTION000

face-sdk

3DiVi Face SDK is a set of software components (code libraries) for building face recognition solutions

Language:C++NOASSERTION000

FaceRecognizer

人脸识别应用

Language:C++GPL-3.0000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0000

FETNet

FETNet: Feature Erasing and Transferring Network for Scene Text Removal

000

lit-gpt

Implementation of Falcon, StableLM, Pythia, INCITE language models based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0000