PISA (pisa-engine)

PISA

pisa-engine

Geek Repo

Performant Indexes and Search for Academia

Github PK Tool:Github PK Tool

PISA's repositories

pisa

PISA: Performant Indexes and Search for Academia

Language:C++License:Apache-2.0Stargazers:880Issues:23Issues:210

ciff

The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative

Language:RustLicense:Apache-2.0Stargazers:8Issues:4Issues:8

Porter2

Porter2 stemming library

Language:C++License:Apache-2.0Stargazers:5Issues:3Issues:2

pypisa

A Python interface to the PISA IR engine

Language:PythonLicense:Apache-2.0Stargazers:3Issues:5Issues:2

ecir19-bisection

Experiments for "Compressing Inverted Indexes with Recursive Graph Bisection: A Reproducibility Study".

Language:C++Stargazers:2Issues:5Issues:0

taily

Implementation of Taily algorithm as described by Aly et al. in the 2013 paper "Taily: shard selection using the tail of score distributions."

Language:C++License:MITStargazers:2Issues:3Issues:4

topk-threshold-estimation

Experiments for "A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing"

License:Apache-2.0Stargazers:2Issues:5Issues:0

warcpp

A C++ parser for the Web Archive (WARC) format.

Language:C++License:Apache-2.0Stargazers:2Issues:4Issues:3

accumulator

Benchmarking several score accumulators used in IR

Language:C++Stargazers:1Issues:5Issues:0

BMP

Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.

Stargazers:1Issues:0Issues:0

KrovetzStemmer

Krovetz stemming library

Language:C++License:Apache-2.0Stargazers:1Issues:3Issues:1

raxpp

C++ bindings for rax: https://github.com/antirez/rax

Language:CMakeLicense:Apache-2.0Stargazers:1Issues:5Issues:3
Language:C++License:Apache-2.0Stargazers:1Issues:4Issues:0

docker

Docker image for PISA

Language:DockerfileLicense:Apache-2.0Stargazers:0Issues:4Issues:1

trecpp

A C++ parser for the TREC document format.

Language:C++License:Apache-2.0Stargazers:0Issues:5Issues:1

wapopp

A C++ parser for the Washington Post (WaPo) format.

Language:C++License:Apache-2.0Stargazers:0Issues:4Issues:0

ciff-hub

Hosting some useful CIFFs

License:Apache-2.0Stargazers:0Issues:0Issues:0

mln

An implementation of the Most-Likely-Next algorithm

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

nyt-corpus-reader

A parser and MongoDB backed store for searching the New York Times Annotated Corpus (LDC2008T19)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

nytpp

A C++ parser for the New York Times (NYT) format.

License:Apache-2.0Stargazers:0Issues:4Issues:0
Language:HTMLStargazers:0Issues:3Issues:0

pisa-jr

Minimal implementation of PISA in Rust

Language:RustLicense:Apache-2.0Stargazers:0Issues:4Issues:0

pyciff

Python bindings for CIFF library at https://github.com/pisa-engine/ciff

Language:PythonLicense:Apache-2.0Stargazers:0Issues:5Issues:1
Language:RustStargazers:0Issues:2Issues:0

standard-benchmark

Standard speed regression test for PISA

Language:RustStargazers:0Issues:4Issues:19

trec-text-rs

TREC Text collection format parser

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0