Zhijian's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31658Issues:0Issues:0

AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

Language:PythonLicense:GPL-3.0Stargazers:2712Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:64137Issues:0Issues:0

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6144Issues:0Issues:0

wild-gaussians

WildGaussians: 3D Gaussian Splatting In the Wild

Language:PythonLicense:NOASSERTIONStargazers:234Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1670Issues:0Issues:0

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2139Issues:0Issues:0

Megatron-LLM

distributed trainer for LLMs

Language:PythonLicense:NOASSERTIONStargazers:517Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19414Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11147Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5555Issues:0Issues:0

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:14802Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9776Issues:0Issues:0

gradio-pdf

Source code of the gradio_pdf custom component.

Language:JavaScriptLicense:MITStargazers:16Issues:0Issues:0

TextBox

TextBox 2.0 is a text generation library with pre-trained language models

Language:PythonLicense:MITStargazers:1068Issues:0Issues:0

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonLicense:MITStargazers:1750Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29632Issues:0Issues:0

coco-cn

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Language:OpenEdge ABLLicense:MITStargazers:177Issues:0Issues:0

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

License:MITStargazers:993Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9543Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:959Issues:0Issues:0

Document-Layout-Analysis

Object Detection Model for Scanned Documents

Language:Jupyter NotebookLicense:MITStargazers:72Issues:0Issues:0

Chinese-LangChain

中文langchain项目|小必应,Q.Talk,强聊,QiangTalk

Language:PythonStargazers:2669Issues:0Issues:0

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5668Issues:0Issues:0

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5933Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35906Issues:0Issues:0
Language:Jupyter NotebookStargazers:95Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4064Issues:0Issues:0