xiaj1011

xiaj1011

Geek Repo

Github PK Tool:Github PK Tool

xiaj1011's starred repositories

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonLicense:NOASSERTIONStargazers:572Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:12995Issues:0Issues:0
Language:PythonLicense:MITStargazers:673Issues:0Issues:0

OpenQA-eval

ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129581Issues:0Issues:0

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Language:PythonLicense:Apache-2.0Stargazers:2907Issues:0Issues:0

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5898Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4460Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5862Issues:0Issues:0

transpeeder

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Language:PythonLicense:Apache-2.0Stargazers:206Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29183Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14411Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9018Issues:0Issues:0

awesome-chatgpt

Curated list of awesome tools, demos, docs for ChatGPT and GPT-3

Stargazers:8181Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3240Issues:0Issues:0

PyChatGPT

⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.

Language:PythonLicense:MITStargazers:4225Issues:0Issues:0

AESOP

Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)

Language:PythonStargazers:27Issues:0Issues:0

quality-controlled-paraphrase-generation

Quality Controlled Paraphrase Generation (ACL 2022)

Language:PythonLicense:Apache-2.0Stargazers:68Issues:0Issues:0

cyac

High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python

Language:CythonLicense:MITStargazers:94Issues:0Issues:0

AhoCorasickDoubleArrayTrie

An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.

Language:JavaStargazers:938Issues:0Issues:0

CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

Language:PythonStargazers:71Issues:0Issues:0

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Language:PythonLicense:MITStargazers:2035Issues:0Issues:0

Snorkel-NER

Named Entity Recognition using Snorkel

Language:Jupyter NotebookStargazers:9Issues:0Issues:0

pyahocorasick

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Language:CLicense:BSD-3-ClauseStargazers:916Issues:0Issues:0

awesome-knowledge-graph

整理知识图谱相关学习资料

Stargazers:4459Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Language:PythonStargazers:6500Issues:0Issues:0

CCKS2019-CKBQA

A system for CCKS2019-CKBQA, whose single system reach 0.69 and ensemble system reach 0.73

Language:Jupyter NotebookStargazers:41Issues:0Issues:0

ccks2019-ckbqa-4th-codes

中文知识库问答代码,CCKS2019 CKBQA评测第四名解决方案

Language:PythonStargazers:475Issues:0Issues:0

DeepEventMine

DeepEventMine: End-to-end Neural Nested Event Extraction from Biomedical Texts

Language:PythonLicense:Apache-2.0Stargazers:94Issues:0Issues:0

GEANet-BioMed-Event-Extraction

Code for the paper Biomedical Event Extraction with Hierarchical Knowledge Graphs

Language:PythonLicense:MITStargazers:59Issues:0Issues:0