alvin zheng's starred repositories

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8175Issues:0Issues:0

selenium-with-fingerprints

Anonymous automation via selenium with fingerprint replacement technology.

Language:JavaScriptLicense:MITStargazers:68Issues:0Issues:0

ICON

[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals

Language:PythonLicense:NOASSERTIONStargazers:1576Issues:0Issues:0

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Language:PythonStargazers:416Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:PythonStargazers:9925Issues:0Issues:0

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Language:Jupyter NotebookLicense:MITStargazers:14368Issues:0Issues:0

AI-generated-characters

AI-generated-character

Language:Jupyter NotebookStargazers:450Issues:0Issues:0

chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

License:MITStargazers:2Issues:0Issues:0

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR

Language:PythonLicense:MITStargazers:2837Issues:0Issues:0

stylegan2

StyleGAN2 - Official TensorFlow Implementation

Language:PythonLicense:NOASSERTIONStargazers:10910Issues:0Issues:0

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:PythonStargazers:2818Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10283Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32474Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6210Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46027Issues:0Issues:0

dream-textures

Stable Diffusion built-in to Blender

Language:PythonLicense:GPL-3.0Stargazers:7728Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65575Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13421Issues:0Issues:0
Language:PythonLicense:MITStargazers:54Issues:0Issues:0
Language:Jupyter NotebookStargazers:122Issues:0Issues:0

content-moderation-deep-learning

Deep learning based content moderation from text, audio, video & image input modalities.

License:MITStargazers:298Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24424Issues:0Issues:0

state-of-open-source-ai

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Language:TeXLicense:NOASSERTIONStargazers:1483Issues:0Issues:0

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3214Issues:0Issues:0

GlueStick

Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)

Language:Jupyter NotebookLicense:MITStargazers:535Issues:0Issues:0

Imatch-P

A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.

Language:PythonStargazers:21Issues:0Issues:0

Hierarchical-Localization

Visual localization made easy with hloc

Language:PythonLicense:Apache-2.0Stargazers:3012Issues:0Issues:0

SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Language:PythonLicense:NOASSERTIONStargazers:3179Issues:0Issues:0

selenium-wire

Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.

Language:PythonLicense:MITStargazers:1876Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130190Issues:0Issues:0