Jac Zhao's repositories
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
CloserLookFewShot
source code to ICLR'19, 'A Closer Look at Few-shot Classification'
cmrc2019
The Third Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2019)
ColossalAI
Making large AI models cheaper, faster and more accessible
DialogVED
Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"
Enterprise-Registration-Data-of-Chinese-Mainland
**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
GLM
GLM (General Language Model)
Global-Encoding
Global Encoding for Abstractive Summarization (ACL 2018)
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Machine-Learning
机器学习原理
minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
NREPapers
Must-read papers on neural relation extraction (NRE)
OKD-Reading-List
Papers for Open Knowledge Discovery
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
THUIARWeb
All files which used to build the Official Website of THUIAR.
weightagnostic.github.io
nothing to see here yet
xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding