Shunsuke Kanda's repositories
tongrams-rs
Rust library providing fast language model queries in compressed space
sif-embedding
Rust implementation of SIF and uSIF: Simple and fast sentence embedding
wordfreq-rs
Yet another Rust port of wordfreq
simplearrayhash
Just a fast hash table for string keys
abseil.github.io
Abseil documentation abseil.io
aho-corasick
A fast implementation of Aho-Corasick in Rust.
building-search-app-w-ml
『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ
esci-data
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
kampersanda.github.io
Personal Homepage
lindera
A morphological analysis library.
ml-road
Machine Learning Resources, Practice and Research
rust-csv
A CSV parser for Rust, with Serde support.
rust-pcre2
High level Rust bindings to PCRE2.
SIF
sentence embedding by Smooth Inverse Frequency weighting scheme
SIF-1
Sentence embedding using Smooth Inverse Frequency weighting scheme
SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
simple-simcse
A simple implementation of SimCSE
sudachi.rs
An official Sudachi clone in Rust 🦀
suffix
Fast suffix arrays for Rust (with Unicode support).
wordfreq
Access a database of word frequencies, in various natural languages.