magic282's repositories
cnndm_acl18
Code to obtain the training data for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"
PyTorch_seq2seq
Sequence to Sequence with attention implemented with PyTorch
v2ray-core
A platform for building proxies to bypass network restrictions.
Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
CValues
面向中文大模型价值观的评估与对齐研究
DeepSpeedExamples
Example models using DeepSpeed
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
GoogleScholarMap
A collection of Python scripts that generates a scholar's impact chart and helps create a map of countries from which the scholar is cited
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
openbilibili-go-common
听说这是来自 https://github.com/openbilibili/go-common/ 的 “哔哩哔哩 bilibili 网站后台工程 源码”,不过咱也不知道这是啥。
OpenNMT-py
Open-Source Neural Machine Translation in PyTorch http://opennmt.net/
ria_news_dataset
"Rossiya Segodnya" news dataset
Safety-Prompts
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
shadowsocks-libev
libev port of shadowsocks
TOEFL-Sentence-Insertion-Dataset
The TOEFL sentence insertion dataset used in InsertGNN.
wikiextractor
A tool for extracting plain text from Wikipedia dumps