Jac Zhao's repositories
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
ColossalAI
Making large AI models cheaper, faster and more accessible
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
GLM
GLM (General Language Model)
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
DialogVED
Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
OKD-Reading-List
Papers for Open Knowledge Discovery
NREPapers
Must-read papers on neural relation extraction (NRE)
THUIARWeb
All files which used to build the Official Website of THUIAR.
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
weightagnostic.github.io
nothing to see here yet
Enterprise-Registration-Data-of-Chinese-Mainland
**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。
Global-Encoding
Global Encoding for Abstractive Summarization (ACL 2018)
cmrc2019
The Third Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2019)
minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Machine-Learning
机器学习原理
CloserLookFewShot
source code to ICLR'19, 'A Closer Look at Few-shot Classification'
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers