Yanshuang (YanShuang17)

YanShuang17

Geek Repo

Location:Shanghai,China

Github PK Tool:Github PK Tool

Yanshuang's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53595Issues:509Issues:923

docker_practice

Learn and understand Docker&Container technologies, with real DevOps practice!

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:22861Issues:156Issues:3562

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14030Issues:132Issues:1963

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9834Issues:81Issues:279

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonLicense:Apache-2.0Stargazers:7144Issues:68Issues:140

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:6932Issues:75Issues:381

ChatLaw

中文法律大模型

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5326Issues:33Issues:740

Firefly

Firefly: 大模型训练工具,支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4490Issues:49Issues:258

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4133Issues:28Issues:146

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:3972Issues:40Issues:384

simpleaichat

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Language:PythonLicense:MITStargazers:3402Issues:37Issues:81

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3272Issues:27Issues:264

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Language:PythonLicense:Apache-2.0Stargazers:2851Issues:35Issues:37

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:2638Issues:26Issues:826

Chinese-LangChain

中文langchain项目|小必应,Q.Talk,强聊,QiangTalk

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Language:PythonLicense:MITStargazers:1997Issues:13Issues:104

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1961Issues:36Issues:121

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1483Issues:8Issues:246

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Language:PythonLicense:MITStargazers:1442Issues:41Issues:12

OpenBuddy

Open Multilingual Chatbot for Everyone

uniem

unified embedding model

Language:PythonLicense:Apache-2.0Stargazers:764Issues:7Issues:98

Chinese-LlaMA2

Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版! (完全开源可商用)

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:492Issues:6Issues:58

PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Language:Jupyter NotebookLicense:MITStargazers:416Issues:8Issues:9

Visual-Chinese-LLaMA-Alpaca

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

Language:PythonLicense:Apache-2.0Stargazers:374Issues:9Issues:12

FinEval

FinEval是一个包含金融、经济、会计和证书等领域高质量多项选择题的集合。

Language:PythonLicense:Apache-2.0Stargazers:133Issues:3Issues:5

NER-Pytorch-Chinese

Implemention of NER model on chinese dataset.

Language:PythonLicense:MITStargazers:60Issues:4Issues:9