Beast code in Giters

alvin zheng's repositories

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonApache-2.0100

peclr

This is the pretraining code for PeCLR. An equivariant contrastive learning framework for 3D hand pose estimation. The paper is presented at ICCV 2021.

100

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonGPL-3.0100

3D-art-gallery

This is an interactive 3D art gallery made with Three.js, perfect for artists or designers to exhibit their portfolio of artworks and projects.

000

AI-generated-characters

AI-generated-character

000

chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

MIT000

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

MIT000

ICON

[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals

NOASSERTION000

Imatch-P

A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.

000

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR

MIT000

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Apache-2.0000

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

BSD-3-Clause000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Apache-2.0000

python-qrcode

Python QR Code image generator

NOASSERTION000

Real-Time-Violence-Detection-in-Video-

000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Apache-2.0000

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

NOASSERTION000

stylegan2

StyleGAN2 - Official TensorFlow Implementation

NOASSERTION000

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

MIT000

Swin-Transformer-Semantic-Segmentation

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

Apache-2.0000

SwinTransformer

torch implementation of SwinTransformer

MIT000

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

undetected-chromedriver

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

GPL-3.0000

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

000

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Apache-2.0000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

yolov5_obb

yolov5 + csl_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）基于yolov5的旋转目标检测

GPL-3.0000