Vassili's repositories

amazon-a2i-sample-jupyter-notebooks

Sample Jupyter Notebooks for Amazon Augmented AI (A2I)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amazon-sagemaker-architecting-for-ml

Materials for a 3-day instructor led course on applying machine learning

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

amazon-sagemaker-mlops-workshop

Machine Learning Ops Workshop with SageMaker: lab guides and materials.

Language:Jupyter NotebookLicense:MIT-0Stargazers:0Issues:0Issues:0

autogluon

AutoGluon: AutoML Toolkit for Deep Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-ml-courses

Awesome free machine learning and AI courses with video lectures.

Stargazers:0Issues:0Issues:0

aws-data-wrangler

Utility belt to handle data on AWS.

License:Apache-2.0Stargazers:0Issues:0Issues:0

aws-glue-data-catalog-replication-utility

Replication utility for AWS Glue Data Catalog

License:MIT-0Stargazers:0Issues:0Issues:0

aws-sagemaker-build

Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S3 and GitHub events

License:Apache-2.0Stargazers:0Issues:0Issues:0

bookmark-utils

This repository contains the code for utility developed for bookmark functionality in AWS Glue Python jobs

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

dask-tutorial

Dask tutorial

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

data-scientists-guide-apache-spark

Best practices of using Spark for practicing data scientists in the context of a data scientist’s standard workflow.

Stargazers:0Issues:0Issues:0

datawig

Imputation of missing values in tables.

License:Apache-2.0Stargazers:0Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

License:Apache-2.0Stargazers:0Issues:0Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

end-to-end-transformers

end-to-end Transformers workflow with SageMaker

Stargazers:0Issues:0Issues:0

fzf.aws

:cyclone: Using fuzzy finder to perform AWS operations on the command line

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MIT-0Stargazers:0Issues:0Issues:0

handson-ml2

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

License:Apache-2.0Stargazers:0Issues:0Issues:0

koalas

Koalas: pandas API on Apache Spark

License:Apache-2.0Stargazers:0Issues:0Issues:0

Machine-Learning-with-Python

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

Mastering-Big-Data-Analytics-with-PySpark

Mastering Big Data Analytics with PySpark, Published by Packt

License:MITStargazers:0Issues:0Issues:0

modin

Modin: Speed up your Pandas workflows by changing a single line of code

License:Apache-2.0Stargazers:0Issues:0Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

License:Apache-2.0Stargazers:0Issues:0Issues:0

serverless-sagemaker-orchestration

This example shows how to build a serverless pipeline to orchestrate the continuous training and deployment of a linear regression model for predicting housing prices using Amazon SageMaker, AWS Step Functions, AWS Lambda, and Amazon CloudWatch Events.

License:MIT-0Stargazers:0Issues:0Issues:0

Spark-Programming-In-Python

Apache Spark 3 - Spark Programming in Python for Beginners

License:MITStargazers:0Issues:0Issues:0

Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

License:NOASSERTIONStargazers:0Issues:0Issues:0

sparkmagic

Jupyter magics and kernels for working with remote Spark clusters

License:NOASSERTIONStargazers:0Issues:0Issues:0

workshop

AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker

Stargazers:0Issues:0Issues:0