madehong

madehong

Geek Repo

Company:Peking University

Location:Beijing

Github PK Tool:Github PK Tool

madehong's repositories

Seq2Seq4ATE

Codes for paper Exploring Sequence-to-Sequence Learning for Aspect Term Extraction.

WuDaoCorpus

迄今为止全球最大的中文语料库

bert-finetune

Codes for fine-tuning Bert for kinds of tasks.

Language:PythonStargazers:1Issues:0Issues:0

ALBERT

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

madehong.github.io

Codes for my homepage.

Language:HTMLStargazers:0Issues:2Issues:0

alpaca-chinese-dataset

alpaca中文指令微调数据集

Stargazers:0Issues:0Issues:0

Alpaca-CoT

We extend CoT data to Alpaca to boost its reasoning ability. We are constantly expanding our collection of instruction-tuning data. The instruction collection can be found at https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main (我们将CoT数据扩展到Alpaca以提高其推理能力,同时我们将不断收集更多的instruction-tuning数据集。)

License:Apache-2.0Stargazers:0Issues:0Issues:0

BELLE-prompt

BELLE: Bloom-Enhanced Large Language model Engine(开源中文对话大模型-70亿参数)

License:Apache-2.0Stargazers:0Issues:0Issues:0

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Chinese-alpaca-lora

骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

License:Apache-2.0Stargazers:0Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

License:MITStargazers:0Issues:0Issues:0

ERNIE2Pytorch

ERNIE Pytorch Version

License:MITStargazers:0Issues:0Issues:0

fast-bert

Super easy library for BERT based NLP models

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

License:MITStargazers:0Issues:0Issues:0

LSH_Attention

Calculate Softmax layer of Attention in O(LlogL)(L=sequence length) instead of O(L^2) using polytope Locality-Sensitive Hashing(https://arxiv.org/abs/1802.05751 ).

License:MITStargazers:0Issues:0Issues:0

NeuroNLP2

Deep neural models for core NLP tasks (Pytorch version)

License:GPL-3.0Stargazers:0Issues:0Issues:0

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

License:MITStargazers:0Issues:0Issues:0

pkuthss

LaTeX template for dissertations in Peking University

Stargazers:0Issues:0Issues:0

pyGAT

Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)

License:MITStargazers:0Issues:0Issues:0

RLHF

Implementation of Chinese ChatGPT

Stargazers:0Issues:0Issues:0

SemBERT

Semantics-aware BERT for Language Understanding (AAAI 2020)

Stargazers:0Issues:0Issues:0

SG-Net

AAAI2020: SG-Net: Syntax-guided machine reading comprehension

Language:PythonStargazers:0Issues:1Issues:0

Statistical-Learning-Methods

Implement Statistical Leanring Methods, Li Hang the hard way. 李航《统计学习方法》一书的硬核 Python 实现

Stargazers:0Issues:0Issues:0

summarize-from-feedback-RL

Code for "Learning to summarize from human feedback"

License:NOASSERTIONStargazers:0Issues:0Issues:0

Tencent2020_Rank1st

The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.

Stargazers:0Issues:0Issues:0

transformer-xl-chinese

transformer xl在中文文本生成上的尝试(可写小说、古诗)(transformer xl for text generation of chinese)

License:Apache-2.0Stargazers:0Issues:0Issues:0

TwinBert

pytorch implementation of the TwinBert paper

Stargazers:0Issues:0Issues:0

vimrc

The ultimate Vim configuration (vimrc)

License:MITStargazers:0Issues:0Issues:0