Shintaro Harada's starred repositories

mbrs

A library for minimum Bayes risk (MBR) decoding

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Language:PythonLicense:Apache-2.0Stargazers:6925Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2673Issues:0Issues:0

seqio

Task-based datasets, preprocessing, and evaluation for sequence models.

Language:PythonLicense:Apache-2.0Stargazers:558Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:177Issues:0Issues:0
Language:PythonLicense:MITStargazers:46Issues:0Issues:0

trans-encoder

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Language:PythonLicense:Apache-2.0Stargazers:133Issues:0Issues:0

PromCSE

Code for "Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning (EMNLP 2022)"

Language:PythonStargazers:134Issues:0Issues:0

DiffCSE

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

Language:PythonLicense:MITStargazers:290Issues:0Issues:0

kwja

An integrated Japanese analyzer based on foundation models

Language:PythonLicense:MITStargazers:129Issues:0Issues:0

SentEval

A python tool for evaluating the quality of sentence embeddings.

Language:PythonLicense:NOASSERTIONStargazers:2087Issues:0Issues:0

orbax

Orbax provides common checkpointing and persistence utilities for JAX users

Language:PythonLicense:Apache-2.0Stargazers:296Issues:0Issues:0

n-grammer-flax

Implementation of N-Grammer in Flax

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

jax-rl

JAX implementations of core Deep RL algorithms

Language:PythonLicense:MITStargazers:79Issues:0Issues:0

ott

Optimal transport tools implemented with the JAX framework, to get differentiable, parallel and jit-able computations.

Language:PythonLicense:Apache-2.0Stargazers:521Issues:0Issues:0

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

Stargazers:539Issues:0Issues:0

konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

Language:PythonLicense:MITStargazers:230Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:30402Issues:0Issues:0

daachorse

🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

Language:RustLicense:Apache-2.0Stargazers:201Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

TUPE

Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.

Language:PythonLicense:MITStargazers:250Issues:0Issues:0

attach-juxtapose-parser

Code for the paper "Strongly Incremental Constituency Parsing with Graph Neural Networks"

Language:PythonLicense:BSD-2-ClauseStargazers:34Issues:0Issues:0

vaporetto

🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

Language:RustLicense:Apache-2.0Stargazers:229Issues:0Issues:0

fairseq-tagging

a Fairseq fork for sequence tagging/labeling tasks

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

compare-mt

A tool for holistic analysis of language generations systems

Language:PythonLicense:BSD-3-ClauseStargazers:465Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22684Issues:0Issues:0

language-models-are-knowledge-graphs-pytorch

Language models are open knowledge graphs ( non official implementation )

License:MITStargazers:13Issues:0Issues:0

awd-lstm-lm

LSTM and QRNN Language Model Toolkit for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:1959Issues:0Issues:0

PILM

Language model with phrase induction

Language:PythonLicense:BSD-3-ClauseStargazers:14Issues:0Issues:0