Paul Groth (pgroth)

pgroth

Geek Repo

Location:Amsterdam

Home Page:http://pgroth.com

Github PK Tool:Github PK Tool


Organizations
Data2Semantics
openphacts

Paul Groth's starred repositories

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:24858Issues:667Issues:2109

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:10094Issues:24Issues:536

docquery

An easy way to extract information from documents

Language:PythonLicense:MITStargazers:1681Issues:24Issues:46

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1574Issues:19Issues:535

differential-datalog

DDlog is a programming language for incremental computation. It is well suited for writing programs that continuously update their output in response to input changes. A DDlog programmer does not write incremental algorithms; instead they specify the desired input-output mapping in a declarative manner.

Language:JavaLicense:MITStargazers:1353Issues:30Issues:415

BLINK

Entity Linker solution

Language:PythonLicense:MITStargazers:1153Issues:40Issues:94

semantic-python-overview

(subjective) overview of projects which are related both to python and semantic technologies (RDF, OWL, Reasoning, ...)

mandala

A simple & elegant experiment tracking framework that integrates persistence logic & best practices directly into Python

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:470Issues:4Issues:9

grimoirelab

GrimoireLab: platform for software development analytics and insights

Language:RoffLicense:GPL-3.0Stargazers:465Issues:24Issues:375

sgr

sgr (command line client for Splitgraph) and the splitgraph Python library

Language:PythonLicense:NOASSERTIONStargazers:326Issues:9Issues:43

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:301Issues:9Issues:33

REL

REL: Radboud Entity Linker

Language:PythonLicense:MITStargazers:300Issues:11Issues:93

potato

potato: portable text annotation tool

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:276Issues:10Issues:41

nlp-labelling

Labelling platform for text using weak supervision.

Language:JavaScriptLicense:GPL-3.0Stargazers:257Issues:4Issues:13

visions

Type System for Data Analysis in Python

Language:PythonLicense:NOASSERTIONStargazers:203Issues:6Issues:61

morph-kgc

Powerful RDF Knowledge Graph Generation with RML Mappings

Language:PythonLicense:Apache-2.0Stargazers:172Issues:14Issues:162

valentine

A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.

Language:PythonLicense:Apache-2.0Stargazers:78Issues:9Issues:35

mlinspect

Inspect ML Pipelines in Python in the form of a DAG

Language:PythonLicense:Apache-2.0Stargazers:68Issues:5Issues:52

openclean

openclean - Data Cleaning and data profiling library for Python

Language:PythonLicense:BSD-3-ClauseStargazers:64Issues:10Issues:1

record-linkage-tutorial

A tutorial on entity resolution (record linkage or de-duplication)

Language:TeXLicense:GPL-2.0Stargazers:62Issues:9Issues:2

melt

MELT - Matching EvaLuation Toolkit

Language:JavaLicense:MITStargazers:45Issues:11Issues:50

CleanML

A Benchmark for Joint Data Cleaning and Machine Learning

GeeseDB

Graph Engine for Exploration and Search

Language:PythonLicense:MITStargazers:36Issues:9Issues:22

RFC-Security-Research

Paper, data and code from Investigating Potential Security Vulnerability Manifestation through Various Analyses & Inferences Regarding Internet RFCs

Language:HTMLStargazers:18Issues:7Issues:0
Language:JavaLicense:Apache-2.0Stargazers:7Issues:4Issues:0

workshop_data_viz

Data visualization workshop (Ams data science center, 2022Feb)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:6Issues:2Issues:0

text-alpha

Python implementation of character-level, textual inter-annotator agreement with Krippendorff's alpha.

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

data-discovery

Leveraging table semantics for data or knowledge discovery

Language:Jupyter NotebookStargazers:2Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

NeuralDB

Database Reasoning Over Text project for ACL paper

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0