Yanshuang (YanShuang17)

YanShuang17

Geek Repo

Location:Shanghai,China

Github PK Tool:Github PK Tool

Yanshuang's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54662Issues:516Issues:942

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27820Issues:188Issues:4393

docker_practice

Learn and understand Docker&Container technologies, with real DevOps practice!

jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonLicense:Apache-2.0Stargazers:20655Issues:210Issues:1940

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13168Issues:91Issues:631

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12671Issues:80Issues:820

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10831Issues:88Issues:299

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10827Issues:97Issues:346

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7019Issues:77Issues:387

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6206Issues:38Issues:893

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4295Issues:30Issues:146

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4041Issues:40Issues:387

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3535Issues:33Issues:1157

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3325Issues:27Issues:265

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Language:PythonLicense:MITStargazers:2040Issues:13Issues:104

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2011Issues:36Issues:122

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Language:PythonLicense:MITStargazers:1464Issues:41Issues:13

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

BCEmbedding

Netease Youdao's open-source embedding and reranker models for RAG products.

Language:PythonLicense:Apache-2.0Stargazers:1239Issues:9Issues:68

OpenBuddy

Open Multilingual Chatbot for Everyone

pymilvus

Python SDK for Milvus.

Language:PythonLicense:Apache-2.0Stargazers:955Issues:19Issues:832

rank_bm25

A Collection of BM25 Algorithms in Python

Language:PythonLicense:Apache-2.0Stargazers:930Issues:10Issues:31

Chinese-LlaMA2

Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版! (完全开源可商用)

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:556Issues:6Issues:63

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:555Issues:9Issues:36

PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Language:Jupyter NotebookLicense:MITStargazers:465Issues:7Issues:9

RAG-Retrieval

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder

Language:PythonLicense:MITStargazers:376Issues:6Issues:22

FinEval

FinEval是一个中文金融领域高质量多项选择与文本问答题的集合。

Language:PythonLicense:Apache-2.0Stargazers:146Issues:3Issues:5

NER-Pytorch-Chinese

Implemention of NER model on chinese dataset.

Language:PythonLicense:MITStargazers:61Issues:4Issues:9