Tarek Saier's repositories

unarXive

A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network

Language:PythonLicense:MITStargazers:256Issues:6Issues:17

anki_add_pitch_plugin

Anki addon to automatically add pitch accent information to cards.

Language:PythonLicense:MITStargazers:32Issues:4Issues:40

JSONkeeper

A flask web application for storing JSON documents; with some special functions for JSON-LD.

Language:PythonLicense:MITStargazers:8Issues:2Issues:12

Canvas-Indexer

A flask web application that crawls Activity Streams for IIIF Canvases and offers a search API.

Language:PythonLicense:MITStargazers:6Issues:2Issues:6

kanjiplot

plots number of notes and kanji in an Anki deck

Language:HTMLLicense:WTFPLStargazers:6Issues:3Issues:0

ecir2020

Code and evaluation details for ECIR 2020 paper "Semantic Modelling of Citation Contexts for Context-aware Citation Recommendation"

Language:PythonLicense:MITStargazers:5Issues:2Issues:3

anki_add_pitch

Script to automatically add pitch accent information to an Anki deck.

Language:PythonLicense:MITStargazers:4Issues:2Issues:1

hyperpie

Source code, data, and documentation for the ECIR paper “HyperPIE: Hyperparameter Information Extraction from Scientific Publications”

Language:PythonStargazers:4Issues:2Issues:0
Language:PythonStargazers:2Issues:1Issues:0

slf_category_cards

Stadt, Land, Fluss - Kategoriekarten

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:2Issues:1Issues:0

cross-lingual-citations-from-en

Source code, data, and evaluation details for “Cross-Lingual Citations in English Papers: A Large-Scale Analysis of Prevalence, Usage, and Impact”

Language:Jupyter NotebookStargazers:1Issues:2Issues:0

Curation-Tracer

A flask web application for IIIF resource usage analytics with regard to IIIF Curations.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0
Language:TeXLicense:CC-BY-4.0Stargazers:1Issues:1Issues:0

sdp2021

Source code and evaluation details for the SDP 2021 paper “Bootstrapping Multilingual Metadata Extraction: A Showcase in Cyrillic”

Language:PythonStargazers:1Issues:2Issues:0

ulite2022

Code and data for the paper “A Blocking-Based Approach to Enhance Large-Scale Reference Linking” at ULITE@JCDL 2022

Language:PythonStargazers:1Issues:2Issues:0
Language:JavaScriptStargazers:1Issues:2Issues:0

anki_image_overview

Small script to extract images used in Anki cards and create a minimal overview webpage

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

calibre_html2aozora_txt

Clean 縦書き on a Kindle 4 from AZW3 files

Language:PythonStargazers:0Issues:2Issues:0

carcassoonne

carcass∞nne

Language:HTMLStargazers:0Issues:1Issues:0
Language:LuaStargazers:0Issues:2Issues:0
Language:SCSSStargazers:0Issues:1Issues:0
Language:Vim ScriptStargazers:0Issues:2Issues:0

jekyll-tag-pages-via-symlinks

Minimal PoC demonstrating how symlinks can be used to generate per tag pages in Jekyll.

Stargazers:0Issues:2Issues:0
Language:TeXLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

qdrant-js

JavaScript/Typescript SDK for Qdrant Vector Database

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SciXGen

Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"

Language:PythonStargazers:0Issues:1Issues:0

sirtetris.com_2015_python

rebuilt in python

Language:HTMLLicense:WTFPLStargazers:0Issues:2Issues:0
Language:HTMLLicense:WTFPLStargazers:0Issues:2Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0

webring

Make yourself a website

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0