Dominik Schiller's starred repositories

ML-For-Beginners

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:56089Issues:527Issues:970

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:38918Issues:247Issues:494

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:20185Issues:153Issues:265

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:20028Issues:257Issues:72

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11978Issues:136Issues:698

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10812Issues:137Issues:162

ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Language:PythonLicense:Apache-2.0Stargazers:9976Issues:113Issues:709

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8760Issues:134Issues:1093

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3787Issues:55Issues:52

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3563Issues:42Issues:187

TensorFlow.NET

.NET Standard bindings for Google's TensorFlow for developing, training and deploying Machine Learning models in C# and F#.

Language:C#License:Apache-2.0Stargazers:3242Issues:123Issues:835

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:3143Issues:98Issues:53

llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.

Language:Jupyter NotebookLicense:MITStargazers:1963Issues:24Issues:46

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Language:PythonLicense:MITStargazers:1953Issues:38Issues:16

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1379Issues:29Issues:92

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:941Issues:44Issues:413

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:854Issues:30Issues:95

gridfinity-catalog

Catalog of Gridfinity Designs and Other Resources

Quantus

Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:545Issues:10Issues:131

CLAP

Learning audio concepts from natural language supervision

Language:PythonLicense:MITStargazers:470Issues:14Issues:21

Awesome-LLM-hallucination

LLM hallucination paper list

ViTPose_pytorch

An unofficial implementation of ViTPose [Y. Xu et al., 2022]

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:102Issues:1Issues:18

deep_audio_features

Pytorch implementation of deep audio embedding calculation

Language:PythonLicense:MITStargazers:95Issues:6Issues:33

SyncPy

SyncPy is a novel open-source analytic library for investigating synchrony in a fast and exhaustive way.

LexC-Gen

Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

Language:PythonStargazers:12Issues:3Issues:0
Language:PythonStargazers:3Issues:0Issues:0