snjstudent's starred repositories

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16580Issues:149Issues:1473

dvc

🦉 ML Experiments and Data Management with Git

Language:PythonLicense:Apache-2.0Stargazers:13444Issues:141Issues:4654

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:13082Issues:113Issues:1806

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4656Issues:55Issues:63

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4157Issues:49Issues:95

arXivTimes

repository to research & share the machine learning articles

pyppeteer

Headless chrome/chromium automation library (unofficial port of puppeteer)

Language:PythonLicense:NOASSERTIONStargazers:3552Issues:48Issues:326

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2923Issues:89Issues:97

sam

SAM: Sharpness-Aware Minimization (PyTorch)

Language:PythonLicense:MITStargazers:1712Issues:12Issues:81

PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Language:PythonLicense:NOASSERTIONStargazers:882Issues:50Issues:150

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language:PythonLicense:MITStargazers:726Issues:23Issues:72

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

AnimeInterp

The code for CVPR21 paper "Deep Animation Video Interpolation in the Wild"

anime-face-detector

Anime Face Detector using mmdet and mmpose

Language:PythonLicense:MITStargazers:392Issues:8Issues:20

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonLicense:MITStargazers:304Issues:9Issues:27

bizarre-pose-estimator

WACV2022: Transfer Learning for Pose Estimation of Illustrated Characters

Language:PythonLicense:AGPL-3.0Stargazers:215Issues:12Issues:9

ReazonSpeech

Massive open Japanese speech corpus

Language:PythonLicense:Apache-2.0Stargazers:209Issues:7Issues:20

HA2G

[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"

Language:PythonLicense:GPL-3.0Stargazers:124Issues:4Issues:20

infogcn

Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"

segmentation-kit

Speech Segmentation Toolkit using Julius

Language:PerlLicense:MITStargazers:85Issues:11Issues:5

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:42Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:31Issues:1Issues:1
Language:PythonLicense:MITStargazers:17Issues:4Issues:1

segmentation-kit

Speech Segmentation Toolkit using Julius

Language:PerlLicense:MITStargazers:17Issues:3Issues:0

TextGridConverter

convert .lab files to .TextGrid files, which can be used in Praat

Language:PythonLicense:MITStargazers:14Issues:1Issues:1

FairseqTutorial

Fairseq初心者のための日本語チュートリアルです.

Language:Jupyter NotebookStargazers:10Issues:1Issues:1

jvs_r9y9

JVS (Japanese versatile speech) コーパスの自作のラベル

Language:ShellStargazers:4Issues:1Issues:0