Yves Maurer (ymaurer)

ymaurer

Geek Repo

Location:Luxembourg

Github PK Tool:Github PK Tool


Organizations
natliblux

Yves Maurer's repositories

cdx-summarize

Summarize CDX(J) files for MIME analysis per 2nd-level domain

Language:PythonLicense:Apache-2.0Stargazers:8Issues:2Issues:0

cdx-summarize-warc-indexer

Summarize Web Archive holdings using an existing SOLR index

Language:ShellLicense:Apache-2.0Stargazers:1Issues:0Issues:0

eluxemburgensia-opendata-ark

Get the Archival resource keys from eluxemburgensia.lu public opendata set (the text analysis pack)

Language:ShellStargazers:1Issues:1Issues:0

fasttiffcrop

crop multiple jpegs from a single source tiff in a fast and memory-efficient way

Language:C++Stargazers:1Issues:2Issues:0

cdx-index-client

A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

common-crawl-dl

Download common crawl data for some top level domains

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fixit_tiff

fixes some issues in (potentially) baseline tiffs

Language:CStargazers:0Issues:1Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mets-export-illustrations

Export illustrations from METS files alongside metadata

Language:PerlLicense:GPL-3.0Stargazers:0Issues:1Issues:0

speller-ocr-eval

Evaluate OCR correctness by identifying the language and then running a spell checker

Language:ShellLicense:GPL-3.0Stargazers:0Issues:1Issues:0

warcnet-cdx-summarize-analysis

Generate reports from cdx-summarize files

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0