Tobey Yang (TobeyYang)

TobeyYang

Geek Repo

Location:Beijing, China

Github PK Tool:Github PK Tool

Tobey Yang's starred repositories

DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Language:PythonLicense:Apache-2.0Stargazers:377Issues:0Issues:0

HMNet

Official Implementation of "A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining""

Language:PythonLicense:NOASSERTIONStargazers:78Issues:0Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:6057Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1601Issues:0Issues:0

StyleDGPT

The code for ``STYLEDGPT: Stylized Response Generation with Pre-trained LanguageModels'' (Findings of EMNLP2020)

Language:PythonStargazers:21Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:18838Issues:0Issues:0

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8778Issues:0Issues:0

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++License:NOASSERTIONStargazers:2400Issues:0Issues:0

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:Apache-2.0Stargazers:2947Issues:0Issues:0

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

License:MITStargazers:904Issues:0Issues:0

gdown.pl

Google Drive direct download of big files

Language:PerlLicense:GPL-3.0Stargazers:933Issues:0Issues:0

gpt2-ml

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Language:PythonLicense:Apache-2.0Stargazers:1714Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:2025Issues:0Issues:0

S2S_Temp

Code for EMNLP2019 paper "Low-Resource Response Generation with Template Prior"

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonLicense:MITStargazers:10452Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1674Issues:0Issues:0

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

License:MITStargazers:2559Issues:0Issues:0

pigeonXT

🐦 Quickly annotate data from the comfort of your Jupyter notebook

Language:PythonLicense:Apache-2.0Stargazers:271Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33574Issues:0Issues:0

py-corenlp

Python wrapper for Stanford CoreNLP

Language:PythonStargazers:353Issues:0Issues:0

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Language:PythonLicense:NOASSERTIONStargazers:1327Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130203Issues:0Issues:0

Yahoo-News-Dataset

Yahoo! news dataset of DeepCom (EMNLP2019)

Stargazers:17Issues:0Issues:0

awesome-deep-learning-papers

The most cited deep learning papers

Language:TeXStargazers:25320Issues:0Issues:0

awesome-text-generation

A curated list of recent models of text generation and application

Stargazers:497Issues:0Issues:0

papers

:paperclip: Summaries of papers on deep learning

Stargazers:571Issues:0Issues:0

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Language:PythonLicense:Apache-2.0Stargazers:2383Issues:0Issues:0

texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

Language:PythonLicense:Apache-2.0Stargazers:745Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9916Issues:0Issues:0

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Language:PythonLicense:Apache-2.0Stargazers:6166Issues:0Issues:0