Sotaro Takeshita / 竹下 颯太郎's starred repositories
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
semantic-grep
grep for words with similar meaning to the query
DSI-transformers
A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"
GenIR-Survey
This is the repository for the GenIR survey.
summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
mbr-anomaly
Code for "On the True Distribution Approximation of Minimum Bayes-Risk Decoding," NAACL 2024
py-setproctitle
A Python module to customize the process title
improved-t5
Experiments for efforts to train a new and improved t5
tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)