linzai1992

followers

following

stars

@Microsoft

Suzhou

Yunlin Chen's starred repositories

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION29390 234 1892

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION20213 163 145

reflex

🕸️ Web apps in pure Python 🐍

Language:PythonApache-2.017066 127 1340

bilibili-API-collect

哔哩哔哩-API收集整理【不断更新中....】

Language:JavaScriptNOASSERTION13387 101 533

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION6835 85 95

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonMIT4320 37 25

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3505 108 53

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT3302 30 248

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonApache-2.03222 174 86

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3184 38 267

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03010 25 110

aiXcoder-7B

official repository of aiXcoder-7B Code Large Language Model

Language:PythonApache-2.02112 19 27

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonNOASSERTION1871 22 129

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

Language:CNOASSERTION1721 53 206

Realtime_PyAudio_FFT

Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.

Language:PythonMIT932 30 23

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonMIT871 9 17

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookMIT401 12 40

scrapetube

A YouTube scraper for scraping channels, playlists, and searching 🔎

Language:PythonMIT282 14 47

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonMIT252 26 12

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python244 16 12

dataspeech

Language:PythonMIT194 12 8

Awesome-LLM-related-Papers-Comprehensive-Topics

Awesome LLM-related papers and repos on very comprehensive topics.

shutterscrape

Web scrapper for Shutterstock

Language:PythonMIT139 7 14

VideoRecap

Language:PythonMIT133 2 10

EnCLAP

Official Implementation of EnCLAP

Language:PythonMIT67 3 6

ppgs

High-Fidelity Neural Phonetic Posteriorgrams

Language:PythonMIT55 6 11

Open-Sora-Dataset

Language:Python50 6 5

HQ-Edit

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Language:PythonNOASSERTION4400

audio-pipeline

Language:PythonApache-2.0900

Defuzers

Image generation UI for diffusers.

Language:PythonNOASSERTION800