Leonid Boytsov (searchivarius)

searchivarius

Geek Repo

Company:This is my personal account

Location:Pittsburgh

Home Page:http://searchivarius.org/about

Twitter:@srchvrs

Github PK Tool:Github PK Tool


Organizations
nmslib
oaqa

Leonid Boytsov's repositories

PyFastPFor

Python bindings for the fast integer compression library FastPFor.

Language:C++License:Apache-2.0Stargazers:57Issues:3Issues:0

AccurateLuceneBM25

Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)

PermTest

Permutation algorithms to test statistical significance of experimental results.

BlogCode

Code used in Leonid Boytsov's blog: http://searchivarius.org

EphyraQuestionAnalysis

A collection of OpenEphyra components necessary for question analysis

inpars_light

Scripts to reproduce InPars light paper

License:Apache-2.0Stargazers:3Issues:1Issues:0

pytorch-pretrained-BERT-mod

A slightly modified version of the older version of the transformer library pytorch-pretrained-BERT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:2Issues:0

clearnlp-clearnlp-2.0.2.mod

A patched clearnlp 2.0.2

Language:JavaLicense:NOASSERTIONStargazers:1Issues:1Issues:0

OpenNMT-py

Open Source Neural Machine Translation in PyTorch

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

XMLIterator

SAX sux: an XMLIterator solution for XML documents with iterative structure.

Language:JavaStargazers:1Issues:1Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Language:PythonStargazers:0Issues:1Issues:0

annographix

Structured information retrieval using SOLR (archival version)

Language:JavaStargazers:0Issues:1Issues:0

anserini

A Lucene toolkit for replicable information retrieval research

Language:JavaStargazers:0Issues:1Issues:0

DeepNLP-models-Pytorch

Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

fastscancount

Fast implementations of the scancount algorithm: C++ header-only library

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

medline-query-with-entities

pubmed-query-with-entities

Language:JavaStargazers:0Issues:1Issues:0

metric-learn

Metric learning algorithms in Python

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mgiza

A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.

Language:C++Stargazers:0Issues:1Issues:0

MSMARCO-Document-Ranking-Submissions

Submission archive for the MS MARCO document ranking leaderboard

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

MSMARCO-Passage-Ranking-Submissions

Submission archive for the MS MARCO passage ranking leaderboard

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

n2

TOROS N2 - lightweight approximate Nearest Neighbor library which runs faster even with large datasets

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

pystruct

Simple structured learning framework for python

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

sparse_text_util

A nearly SVMLight (but without the class label) Python writer

Language:C++Stargazers:0Issues:1Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

wikiextractor

A tool for extracting plain text from Wikipedia dumps

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:JavaStargazers:0Issues:1Issues:0
Language:JavaStargazers:0Issues:0Issues:0