王鹤男 (whn09)

whn09

Geek Repo

Company:AWS

Location:Beijing, China

Github PK Tool:Github PK Tool

王鹤男's repositories

table_structure_recognition

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

Language:Jupyter NotebookStargazers:41Issues:4Issues:13
Language:Jupyter NotebookStargazers:1Issues:2Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:PythonStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

darknet

Convolutional Neural Networks

Language:CLicense:NOASSERTIONStargazers:0Issues:3Issues:0

Easy-Wav2Lip

Colab for making Wav2Lip high quality and easy to use

Language:PythonStargazers:0Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MIT-0Stargazers:0Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

iTransformer

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting".

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Llama2-Chinese

Llama中文社区,最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:0Issues:1Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

OpenCastKit

The open-source solutions of FourCastNet and GraphCast

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pangu-pytorch

Weather forecast at 1/3/6/24-hour horizon

Language:PythonStargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

table-transformer

Model training and evaluation code for our dataset PubTables-1M, developed to support the task of table extraction from unstructured documents.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

torchrec

Pytorch domain library for recommendation systems

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

TriplaneGaussian

TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.

Language:PythonStargazers:0Issues:1Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Wav2Lip-HD

High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN

Language:PythonStargazers:0Issues:1Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:0Issues:1Issues:0

YogaPoseEstimation

Using Pose Estimation to Judge Yoga Form

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0