yongtso

yongtso

Geek Repo

Company:Tibet University

Location:TIBET

Home Page:yongtso@163.com

Github PK Tool:Github PK Tool

yongtso's repositories

TSTD

Tibetan Sentiment Tweets Dataset

Stargazers:1Issues:0Issues:0

C0A2DD042

A parallel corpus to train machine translation models

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

WeiboSpider_SentimentAnalysis

借助Python抓取微博数据,并对抓取的数据进行情绪分析

Stargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

vietnamese_spelling_error_correction

Detect misspell words with LSTM and replace it with XLM-R masked language model

License:MITStargazers:0Issues:0Issues:0

SentiWordNet

The SentiWordNet sentiment lexicon

Stargazers:0Issues:0Issues:0

bert

TensorFlow code and pre-trained models for BERT

License:Apache-2.0Stargazers:0Issues:0Issues:0

nlp-beginner

NLP上手教程

Stargazers:0Issues:0Issues:0

SentiLARE

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

Stargazers:0Issues:0Issues:0

covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

License:NOASSERTIONStargazers:0Issues:0Issues:0

bertviz

Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)

License:Apache-2.0Stargazers:0Issues:0Issues:0

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

License:MITStargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

License:Apache-2.0Stargazers:0Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Bo-Eng-Machine-Transation

Tibetan to English Machine Translation

Stargazers:0Issues:0Issues:0

PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

License:MITStargazers:0Issues:0Issues:0

bonlp-dataset

Tibetan NLP training dataset for various NLP task

Stargazers:0Issues:0Issues:0

LOTClass

[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach

License:Apache-2.0Stargazers:0Issues:0Issues:0

bonltk

BoNLTK aims to provide out of the box support for various NLP tasks that an application developer might need for Bokey, Tibetan language.

License:Apache-2.0Stargazers:0Issues:0Issues:0

CharBERT

CharBERT: Character-aware Pre-trained Language Model (COLING2020)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

fastText

Library for fast text representation and classification.

License:MITStargazers:0Issues:0Issues:0

albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Stargazers:0Issues:0Issues:0

botok

🏷 བོད་རྟོགས། [pʰøtɔk̚] Tibetan word tokenizer in Python

License:Apache-2.0Stargazers:0Issues:0Issues:0

namsel

An OCR application focused on machine-print Tibetan text

License:MITStargazers:0Issues:0Issues:0

lstm_next_sequence_prediction

implement recurrent neural network and long short-term memory network from scratch without frameworks

Stargazers:0Issues:0Issues:0

MUSE

A library for Multilingual Unsupervised or Supervised word Embeddings

License:NOASSERTIONStargazers:0Issues:0Issues:0

BabelNet-Sememe-Prediction

Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"

License:MITStargazers:0Issues:0Issues:0