MikeLuck (mikeluck)

mikeluck

Geek Repo

Location:shenzhen

Github PK Tool:Github PK Tool

MikeLuck's starred repositories

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonLicense:BSD-2-ClauseStargazers:2589Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:3439Issues:0Issues:0

SpeechGPT

SpeechGPT Series: Speech Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:1118Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1048Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20317Issues:0Issues:0

build-nanogpt

Video+code lecture on building nanoGPT from scratch

Language:PythonStargazers:3134Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26019Issues:0Issues:0

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Language:PythonLicense:NOASSERTIONStargazers:866Issues:0Issues:0

AudioNotes

快速提取音视频内容,整理成一份结构化的markdown笔记

Language:PythonLicense:MITStargazers:576Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2567Issues:0Issues:0

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:432Issues:0Issues:0

UltraPixel

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Language:PythonLicense:AGPL-3.0Stargazers:352Issues:0Issues:0

florence2-finetuning

Quick exploration into fine tuning florence 2

Language:Jupyter NotebookLicense:MITStargazers:215Issues:0Issues:0

hammal

docker-registry proxy run in cloudflare workers

Language:TypeScriptStargazers:64Issues:0Issues:0

hammal

docker-registry proxy run in cloudflare workers

Language:TypeScriptStargazers:519Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:85Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23906Issues:0Issues:0

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:526Issues:0Issues:0

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5626Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10914Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:2891Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:1727Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:13287Issues:0Issues:0
Language:PythonStargazers:231Issues:0Issues:0

cloudflare-docker-proxy

A docker registry proxy run on cloudflare worker.

Language:JavaScriptStargazers:968Issues:0Issues:0

LAS

LAS Specification

Language:PythonStargazers:141Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18419Issues:0Issues:0

python-markdownify

Convert HTML to Markdown

Language:PythonLicense:MITStargazers:911Issues:0Issues:0

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2548Issues:0Issues:0

piccolo-embedding

code for piccolo embedding model from SenseTime

Language:PythonStargazers:65Issues:0Issues:0