mqchen1993's starred repositories

imgaug

Image augmentation for machine learning experiments.

Language:PythonLicense:MITStargazers:14287Issues:231Issues:514

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonLicense:MITStargazers:13804Issues:129Issues:962

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language:PythonLicense:Apache-2.0Stargazers:11016Issues:193Issues:1061

snipe-it

A free open source IT asset/license management system

Language:PHPLicense:AGPL-3.0Stargazers:10498Issues:345Issues:10621

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9290Issues:97Issues:630

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Language:JavaScriptLicense:Apache-2.0Stargazers:5724Issues:81Issues:163

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5563Issues:46Issues:291

kalidokit

Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.

Language:TypeScriptLicense:MITStargazers:5271Issues:82Issues:0

sumy

Module for automatic summarization of text documents and HTML pages.

Language:PythonLicense:Apache-2.0Stargazers:3464Issues:112Issues:122

SysMocap

A real-time motion capture system for 3D virtual character animating.

Language:JavaScriptLicense:MPL-2.0Stargazers:2437Issues:35Issues:55

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2396Issues:17Issues:173

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2269Issues:32Issues:27

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

pet

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Language:PythonLicense:Apache-2.0Stargazers:1617Issues:47Issues:96

awesome-industrial-anomaly-detection

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

SystemAnimatorOnline

XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online

avatars4all

Live real-time avatars from your webcam in the browser. No dedicated hardware or software installation needed. A pure Google Colab wrapper for live First-order-motion-model, aka Avatarify in the browser. And other Colabs providing an accessible interface for using FOMM, Wav2Lip and Liquid-warping-GAN with your own media and a rich GUI.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:325Issues:24Issues:4

Audio2Head

code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021

Data_Augmentation_Zoo_for_Object_Detection

Includes: Learning data augmentation strategies for object detection | GridMask data augmentation | Augmentation for small object detection in Numpy. Use RetinaNet with ResNet-18 to test these methods on VOC and KITTI.

Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

MetaDialog

Platform for few-shot natural language processing: Text Classification, Sequene Labeling.

Language:PythonLicense:Apache-2.0Stargazers:220Issues:9Issues:13

DiffGesture

[CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Language:PythonLicense:GPL-3.0Stargazers:215Issues:12Issues:24

Data-Augmentation-for-Object-Detection-YOLO-

This is a python based library to augment the training dataset for object detection using YOLO.

data_generator_object_detection_2d

A data generator for 2D object detection

Language:PythonLicense:GPL-3.0Stargazers:86Issues:3Issues:1

AFGIC

Awesome Fine-Grained Image Classification

Language:JavaScriptLicense:MITStargazers:58Issues:2Issues:1

VU-VRM

Lip-sync VRM avatar client for zero-webcam mic-based vtubing

Monica

an amazing virtual anchor with MediaPipe and Kalidokit

Language:JavaScriptStargazers:9Issues:2Issues:0

llm-open-domain-table-reasoner

Official implementation of OpenTab (ICLR2024)

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0