ChatNoir (chatnoir-eu)

ChatNoir

chatnoir-eu

Geek Repo

ChatNoir Research Web Search Engine

Home Page:https://www.chatnoir.eu/

Github PK Tool:Github PK Tool

ChatNoir's repositories

chatnoir-resiliparse

A robust web archive analytics toolkit

Language:CythonLicense:Apache-2.0Stargazers:80Issues:9Issues:26

web-content-extraction-benchmark

Web Content Extraction Benchmark

Language:PythonLicense:Apache-2.0Stargazers:14Issues:4Issues:4

chatnoir2-indexer

ChatNoir Indexer

Language:JavaLicense:MITStargazers:9Issues:5Issues:0

chatnoir-copycat

CopyCat is a resource for deduplication in TREC-style experimental setups.

Language:ArcLicense:MITStargazers:8Issues:4Issues:1

chatnoir2-webclient

ChatNoir Web Frontend

Language:JavaLicense:MITStargazers:8Issues:5Issues:0

chatnoir-warc-dl

This pipeline allows extracting data from WARC files on a CPU cluster and streaming it to a GPU server, where it is processed.

Language:PythonLicense:MITStargazers:7Issues:2Issues:1

chatnoir-api

🔍 Simple, type-safe access to the ChatNoir search API.

Language:PythonLicense:MITStargazers:6Issues:4Issues:3

chatnoir2-mapfile-generator

ChatNoir HDFS Map File Generator

Language:JavaLicense:Apache-2.0Stargazers:5Issues:4Issues:0

chatnoir-pyterrier

🔍 Use the ChatNoir search engine in PyTerrier.

Language:PythonLicense:MITStargazers:3Issues:3Issues:1

webis-uuid

Webis UUID Generation Tool

Language:JavaLicense:MITStargazers:2Issues:4Issues:0
Language:JavaStargazers:0Issues:3Issues:0

chatnoir-warc-indexer

ChatNoir Indexer

Language:PythonStargazers:0Issues:4Issues:0