anbo724's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40154Issues:392Issues:1291

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:30285Issues:281Issues:3642

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18000Issues:185Issues:730

pyecharts

🎨 Python Echarts Plotting Library

Language:PythonLicense:MITStargazers:14688Issues:378Issues:1890

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10720Issues:184Issues:1894

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:6737Issues:58Issues:270

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6543Issues:56Issues:201

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:NOASSERTIONStargazers:4926Issues:78Issues:74

cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3275Issues:73Issues:142

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3038Issues:33Issues:370

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2984Issues:47Issues:77

chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

TCM-Ancient-Books

中医药古籍文本,近700项

SmartCharts

🔥数据可视化,大屏, 支持Echarts,SQL,API,VUE,可用于Jupyter, 比pyecharts容易, 极低门槛,拿来即用,比拖拽方便,项目插件或独立平台皆可, 简单, 敏捷, 高效, 通用化, 高度可定制化,为你完全打通前后端, 图形数据联动, 筛选开发毫无压力, 数据缓存处理机制让报表快人一步

Language:HTMLLicense:Apache-2.0Stargazers:619Issues:10Issues:2

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Text-Image-Augmentation-python

Python implementation of Text-Image-Augmentation

Language:PythonLicense:Apache-2.0Stargazers:240Issues:4Issues:6

Awesome-Medical-Healthcare-Dataset-For-LLM

A curated list of popular Datasets, Models and Papers for LLMs in Medical/Healthcare

License:MITStargazers:130Issues:3Issues:0

wav2vec2-large-xlsr-53-th

Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:45Issues:8Issues:3

OpenConcepts

中文概念图谱OpenConcepts

License:Apache-2.0Stargazers:39Issues:2Issues:0

wav2vec_finetune

ASR: fine-tune wav2vec 2.0 with transformers

Language:PythonStargazers:15Issues:1Issues:0

Tibetan-Computational-linguistics

藏语计算语言学开放资源

Stargazers:7Issues:0Issues:0

Evol

code for Disentangling the cultural evolution of ancient China: a digital humanities perspective

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

phd_model

Multilingual Wav2Vec2 XLSR Phone Recognition Model optimized with Common Phone

Language:PythonLicense:CC0-1.0Stargazers:2Issues:0Issues:0

search-engine

基于django和elasticsearch的简单搜索引擎,具有摘要和高亮效果

Language:PythonStargazers:2Issues:0Issues:0