Qingsong Liu (pineking)

pineking

Geek Repo

Company:@Unisound @unisound-ail

Location:China

Github PK Tool:Github PK Tool


Organizations
kubeflow

Qingsong Liu's repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

License:MITStargazers:0Issues:0Issues:0

Emu

Emu: An Open Multimodal Generalist

Stargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

FiT

FiT: Flexible Vision Transformer for Diffusion Model

License:Apache-2.0Stargazers:0Issues:0Issues:0

generative-models

Generative Models by Stability AI

License:MITStargazers:0Issues:0Issues:0

genmusic_demo_list

a list of demo websites for automatic music generation research

Stargazers:0Issues:0Issues:0

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

License:MITStargazers:0Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

LLaVA

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

Monkey

Monkey (LMM); 多模态大模型 华科小猴子

License:MITStargazers:0Issues:0Issues:0

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:PythonStargazers:0Issues:1Issues:0

MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Stargazers:0Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

License:MITStargazers:0Issues:0Issues:0

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Language:PythonStargazers:0Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

License:MITStargazers:0Issues:0Issues:0

PhotoMaker

PhotoMaker

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

License:MITStargazers:0Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

License:Apache-2.0Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0