callingforhelp / cord-19-search

Vespa application making an index of the CORD-19 dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Vespa Cloud logo

Vespa Cloud sample applications - CORD-19 Search

Vespa application creating an index of the CORD-19 dataset.

Continuous deployment to the Vespa Cloud

Demonstrated Vespa features

  • Accelerated retrieval using weakAnd
  • Custom ranking, both BM25, and advanced neural ColBERT
  • Document and query time inference using language models (ColBERT)
  • Vespa Grouping to allow users to drill down into the result set
  • Vespa approximate nearest neighbor search for similar articles functionality
  • Vespa de-duping, using model inference to compute N * N document to document similarity, at query time to eliminate near duplicates
$ curl "https://api.cord19.vespa.ai/search/?query=sars-cov-2"

About

Vespa application making an index of the CORD-19 dataset.

License:Apache License 2.0


Languages

Language:Jupyter Notebook 81.4%Language:Java 15.9%Language:Python 2.4%Language:Shell 0.2%