Matthieu Le Cauchois's starred repositories

professional-programming

A collection of learning resources for curious software engineers

Language:PythonLicense:MITStargazers:46115Issues:989Issues:28

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34913Issues:320Issues:428

darknet

Convolutional Neural Networks

Language:CLicense:NOASSERTIONStargazers:25630Issues:913Issues:2367

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25130Issues:174Issues:130

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23394Issues:312Issues:975

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18793Issues:158Issues:1446

py-spy

Sampling profiler for Python programs

Language:RustLicense:MITStargazers:12338Issues:112Issues:351

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10089Issues:103Issues:341

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9588Issues:78Issues:117

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9400Issues:77Issues:112

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7787Issues:76Issues:154

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6494Issues:62Issues:121

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5634Issues:46Issues:292

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookLicense:MITStargazers:2995Issues:12Issues:19

DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language:PythonLicense:Apache-2.0Stargazers:2654Issues:36Issues:98

sd-webui-deforum

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Language:PythonLicense:NOASSERTIONStargazers:2650Issues:41Issues:414

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2431Issues:43Issues:82

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1319Issues:12Issues:118

WikiChat

WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.

Language:PythonLicense:Apache-2.0Stargazers:955Issues:15Issues:17

LLM-Training-Puzzles

What would you do with 1000 H100s...

Language:Jupyter NotebookLicense:MITStargazers:809Issues:11Issues:3

augraphy

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Language:PythonLicense:MITStargazers:325Issues:11Issues:135

fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

Language:PythonLicense:LGPL-2.1Stargazers:251Issues:9Issues:21

so-vits-svc-4.0

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:BSD-3-ClauseStargazers:236Issues:2Issues:0

ThaumatoAnakalyptor

Automatic Scroll Segmentation Pipline for CT Scans of Herculaneum papyri

Language:PythonLicense:MITStargazers:71Issues:5Issues:4

UNMT

Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:50Issues:3Issues:2

lotemplate

LOTemplate is document generator used to create documents programatically (ODT, DOCX, PDF) from a template (DOCX or ODT) and a json file.

Language:Rich Text FormatLicense:AGPL-3.0Stargazers:23Issues:6Issues:7

streetview-diffusion

Google Street View with Stable Diffusion + ControlNet

Language:TypeScriptLicense:MITStargazers:19Issues:2Issues:1
Language:C++Stargazers:3Issues:0Issues:0