Ishali Jain's repositories
data-accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
data-pipeline-samples
This repository hosts sample pipelines
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
hadoop
Apache Hadoop
hadoop-etl-udfs
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
serve
Serve, optimize and scale PyTorch models in production
serverless-cloudwatch-logs-exporter
AWS Serverless Lambda function that sends log data from CloudWatch Logs and S3 🎓
ssh-spool-source
Prototype SshSpoolSource for Flume - think Spooling Directory Source over SSH