王鹤男's repositories
table_structure_recognition
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
facefusion
Next generation face swapper and enhancer
iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting".
Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
OpenCastKit
The open-source solutions of FourCastNet and GraphCast
pangu-pytorch
Weather forecast at 1/3/6/24-hour horizon
table-transformer
Model training and evaluation code for our dataset PubTables-1M, developed to support the task of table extraction from unstructured documents.
torchrec
Pytorch domain library for recommendation systems
TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
whisper.cpp
Port of OpenAI's Whisper model in C/C++
YogaPoseEstimation
Using Pose Estimation to Judge Yoga Form
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection