忧郁的狗子's starred repositories

ohmyzsh

🙃 A delightful community-driven (with 2,300+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python, etc), 140+ themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:48746Issues:364Issues:9179

faiss

A library for efficient similarity search and clustering of dense vectors.

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Language:PythonLicense:NOASSERTIONStargazers:24427Issues:582Issues:2749

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:22121Issues:503Issues:2439

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11015Issues:200Issues:2162

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:8618Issues:135Issues:571

tensorboardX

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Language:PythonLicense:MITStargazers:7835Issues:85Issues:450

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonLicense:Apache-2.0Stargazers:6620Issues:71Issues:123

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5574Issues:70Issues:977

3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language:PythonLicense:MITStargazers:3849Issues:58Issues:269

pytorch-yolo-v3

A PyTorch implementation of the YOLO v3 object detection algorithm

Pytorch_Retinaface

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Language:PythonLicense:MITStargazers:2553Issues:42Issues:198

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++License:NOASSERTIONStargazers:2458Issues:69Issues:366

imageio

Python library for reading and writing image data

Language:PythonLicense:BSD-2-ClauseStargazers:1443Issues:31Issues:596

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaLicense:Apache-2.0Stargazers:1086Issues:76Issues:374

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:908Issues:11Issues:105

pyannote-video

Face detection, tracking and clustering in videos

Language:PythonLicense:MITStargazers:431Issues:21Issues:36

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:280Issues:7Issues:66

fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Language:PythonLicense:NOASSERTIONStargazers:119Issues:8Issues:16

awesome-python-machine-learning-resources

a collection of awesome machine learning and deep learning Python libraries&tools. 热门实用机器学习和深入学习Python库和工具的集合

gantt

simple gantt chart in python

Language:PythonLicense:MITStargazers:84Issues:10Issues:7

text

Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.

Language:C++License:MITStargazers:62Issues:13Issues:20

MSDWILD

[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.

Language:HTMLLicense:NOASSERTIONStargazers:32Issues:4Issues:3

DyViSE

Dynamic vision-guided speaker embedding for audio-visual speaker diarization