xiaj1011's starred repositories
Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
OpenQA-eval
ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
DeepSpeedExamples
Example models using DeepSpeed
transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
awesome-chatgpt
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
quality-controlled-paraphrase-generation
Quality Controlled Paraphrase Generation (ACL 2022)
AhoCorasickDoubleArrayTrie
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
BERT-NER-Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Snorkel-NER
Named Entity Recognition using Snorkel
pyahocorasick
Python module (C extension and plain python) implementing Aho-Corasick algorithm
awesome-knowledge-graph
整理知识图谱相关学习资料
NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
CCKS2019-CKBQA
A system for CCKS2019-CKBQA, whose single system reach 0.69 and ensemble system reach 0.73
ccks2019-ckbqa-4th-codes
中文知识库问答代码,CCKS2019 CKBQA评测第四名解决方案
DeepEventMine
DeepEventMine: End-to-end Neural Nested Event Extraction from Biomedical Texts
GEANet-BioMed-Event-Extraction
Code for the paper Biomedical Event Extraction with Hierarchical Knowledge Graphs