Suneel Marthi's repositories
anserini
A Lucene toolkit for replicable information retrieval research
Beam-Ludwig
Beam and Ludwig
DataQuality
Spark Notebooks for Data Quality, Anomaly Detection
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
djl-demo
Demo applications showcasing DJL
flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
flink-connectors
Apache Flink connectors for Pravega.
handwritten-text-recognition-for-apache-mxnet
This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.
hops
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
incubator-iceberg
Apache Iceberg (Incubating)
ludwig
Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code.
milan
Milan is a Scala API and runtime infrastructure for building data-oriented systems, built on top of Apache Flink.
milewski-ctfp-pdf
Bartosz Milewski's 'Category Theory for Programmers' unofficial PDF and LaTeX source
snorkel-tutorials
A collection of tutorials for Snorkel
SpaceNetExploration
A sample project demonstrating how to extract building footprints from satellite images using a semantic segmentation model. Data from the SpaceNet Challenge.
stateful-functions
Stateful Functions for Apache Flink
strange
Quantum Computing API for Java
tf-encrypted
A Framework for Machine Learning on Encrypted Data
video-samples
Sample applications for video processing with Pravega