Mingming-Yin (KelleyYin)

KelleyYin

Geek Repo

Company:Sogou MT, Tencent

Location:Hanzhou & Suzhou & Beijing

Github PK Tool:Github PK Tool

Mingming-Yin's starred repositories

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:35126Issues:246Issues:4951

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31948Issues:195Issues:1147

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29968Issues:195Issues:4695

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:22692Issues:510Issues:2457

analysis-ik

🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.

Language:JavaLicense:Apache-2.0Stargazers:16441Issues:595Issues:958

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11655Issues:169Issues:229

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9473Issues:121Issues:136

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7607Issues:95Issues:1549

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4288Issues:32Issues:449

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3928Issues:35Issues:329

wikiextractor

A tool for extracting plain text from Wikipedia dumps

Language:PythonLicense:AGPL-3.0Stargazers:3723Issues:74Issues:243

work-in-australia

Work in Australia as a Developer / 程序员如何申请到澳洲工作

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language:PythonLicense:Apache-2.0Stargazers:2058Issues:55Issues:39

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1991Issues:19Issues:79

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonLicense:Apache-2.0Stargazers:1836Issues:41Issues:294

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1618Issues:18Issues:540

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1294Issues:14Issues:55

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellLicense:NOASSERTIONStargazers:971Issues:38Issues:19

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:662Issues:9Issues:32

Chinese-Mixtral-8x7B

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Language:PythonLicense:Apache-2.0Stargazers:636Issues:15Issues:28

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:626Issues:9Issues:125

Wandb_Tutorial

How to use wandb?

Language:PythonLicense:Apache-2.0Stargazers:580Issues:3Issues:1

DISC-LawLLM

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Language:PythonLicense:Apache-2.0Stargazers:503Issues:10Issues:48

LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonLicense:GPL-3.0Stargazers:336Issues:4Issues:17
Language:RustLicense:Apache-2.0Stargazers:280Issues:32Issues:16

vllm-client

vLLM client with minimal dependencies

Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:0