Sato's repositories
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
30-Days-of-ML-Kaggle
Machine learning beginner to Kaggle competitor in 30 days. Non-coders welcome. The program starts Monday, August 2, and lasts four weeks. It's designed for people who want to learn machine learning.
dask
Parallel computing with task scheduling
lakeFS
Git-like capabilities for your object storage
spark-daria
Essential Spark extensions and helper methods ✨😲
iceberg
Apache Iceberg
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
mri-deep-learning-tools
Resurces for MRI images processing and deep learning in 3D
spring-xd-ambari
Apache Ambari integration for Spring XD
spark-deep-learning
Deep Learning Pipelines for Apache Spark
trajectory-clustering-methods
Comparing Different Clustering Methods and Similarity Metrics on Trajectory Datasets
python-ambariclient
Python client bindings for the Apache Ambari REST API
system-design-architecture
A collection of awesome software, libraries and frameworks, design and architecture principles, books and videos, important resources and best practices about System Design & Architecture
notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
deep-forecasting
Perform multivariate time series forecasting using LSTM networks and DeepLIFT for interpretation
CVE-2021-3156
Sudo Baron Samedit Exploit
Multimodal-datasets
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are available
TransNetV2
TransNet V2: Shot Boundary Detection Neural Network
ubertooth
Software, firmware and hardware designs for Ubertooth
spark-ranger-plugin
ACL Management for Apache Spark SQL with Apache Ranger.
docker-cheat-sheet
Docker Cheat Sheet
spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
AR-Net
A simple Auto-Regressive Neural Network for time-series
Siamese-Network
Simese Network for similarity and ranking
dfhz_hdp_mpack
Install Ambari 2.7.5 with HDP 3.1.4 without using Hortonworks repositories.
Deep-learning-books
Books for machine learning, deep learning, math, NLP, CV, RL, etc
efficientnet
Implementation of EfficientNet model. Keras and TensorFlow Keras.
gr-gsm
Gnuradio blocks and tools for receiving GSM transmissions
katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks