Kunkka (Captain-F)

Captain-F

Geek Repo

Company:Nanjing University

Location:Nanjing, China

Github PK Tool:Github PK Tool

Kunkka's starred repositories

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Language:Jupyter NotebookStargazers:90Issues:0Issues:0

llama2-webui

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.

Language:Jupyter NotebookLicense:MITStargazers:1957Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24500Issues:0Issues:0

ControlVideo

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Language:PythonLicense:MITStargazers:747Issues:0Issues:0

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:28754Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15057Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25217Issues:0Issues:0

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Language:PythonStargazers:3018Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18025Issues:0Issues:0

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

Language:PythonLicense:MITStargazers:627Issues:0Issues:0

S.A.T.U.R.D.A.Y

A toolbox for working with WebRTC, Audio and AI

Language:GoLicense:MITStargazers:660Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:29680Issues:0Issues:0

LxgwNeoZhiSong

A Chinese serif font derived from IPAmj Mincho. 一款衍生于「IPAmj明朝」的中文宋体字型。

License:NOASSERTIONStargazers:610Issues:0Issues:0

Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

Language:PythonLicense:Apache-2.0Stargazers:8239Issues:0Issues:0

pennylane

PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.

Language:PythonLicense:Apache-2.0Stargazers:2230Issues:0Issues:0

quantum

Hybrid Quantum-Classical Machine Learning in TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:1770Issues:0Issues:0

qtransformer

Quantum-enhanced transformer neural network

Language:PythonStargazers:106Issues:0Issues:0

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:252Issues:0Issues:0

musicaiz

A python framework for symbolic music generation, evaluation and analysis

Language:PythonLicense:AGPL-3.0Stargazers:162Issues:0Issues:0

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:811Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:11039Issues:0Issues:0

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2644Issues:0Issues:0

MelGAN-VC

MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

Language:Jupyter NotebookLicense:MITStargazers:226Issues:0Issues:0

groove2groove

Code for "Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data"

Language:PythonLicense:BSD-3-ClauseStargazers:152Issues:0Issues:0

MusicTransformer-Pytorch

MusicTransformer written for MaestroV2 using the Pytorch framework for music generation

Language:PythonLicense:MITStargazers:226Issues:0Issues:0

Multimodal-GPT

Multimodal-GPT

Language:PythonLicense:Apache-2.0Stargazers:1457Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23141Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8139Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54939Issues:0Issues:0

chineseocr

yolo3+ocr

Language:PythonLicense:MITStargazers:5889Issues:0Issues:0