gurunath's repositories
lakehouse-sharing
A Table format agnostic data sharing framework
streamlit-react-flow
A Streamlit Wrapper around React-flow component
pipelineAPI
A Simple Data Pipeline with Rest API
trino-lucario
Storage connector for Trino
dask-sql
SQL Engine for Dask
aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
booking_prediction
A Very Quick POC
connectors
Connectors for Delta Lake
creme
:loop: Online machine learning in Python
dask
Parallel computing with task scheduling
dask-interop
Integration tests to demonstrate Dask's interoperability with other systems
dask-sql-etl
This is POC to implement ETL using pure SQL and DASK
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
delta-rs
A native Rust library for Delta Lake, with bindings into Python and Ruby.
gimme-aws-creds
A CLI that utilizes Okta IdP via SAML to acquire temporary AWS credentials
interpret
Fit interpretable models. Explain blackbox machine learning.
obsei
Obsei is intended to be a workflow automation tool for text segmentation need.
rajagurunath
Portfolio Repo
ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
sweep
Sweep is an AI junior developer
video-analytics
Engineering video analytics use case using RaspberryPI Nodes