alephdata / aleph

Search and browse documents and data; find the people and companies you look for.

Home Page:http://docs.aleph.occrp.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FEATURE: Use Elasticsearch multi-search to fetch xref candidates in parallel

tillprochaska opened this issue · comments

Is your feature request related to a problem? Please describe.
When computing xref matches for a collection, Aleph sends one separate query to Elasticsearch to find entities that could be similar.

Describe the solution you'd like
We should batch multiple of those queries in a single HTTP request to the multi-search endpoint. This allows processing of multiple queries in parallel and reduces the network overhead by reducing the number of separate HTTP requests.

The batch size should be configurable.

Describe alternatives you've considered
-/-

Additional context
-/-