Big Data Journal Projects (AWS-Big-Data-Projects)

Big Data Journal Projects

AWS-Big-Data-Projects

Geek Repo

This Projects are done under Cloud Tech and BigdataJournal Community Group

Location:Vadodara

Home Page:https://www.linkedin.com/company/thebigdatajournal

Twitter:@thebigdatajour

Github PK Tool:Github PK Tool

Big Data Journal Projects's repositories

Airline_Data_Analysis

Process to gather streaming data from Airline API using NiFi & batch data using AWS redshift using Sqoop and build a data pipeline to analyse the data using Apache Hive and Druid and compare the performances ,to discuss the hive optimization techniques and visualise the data using AWS Quicksight

License:GPL-3.0Stargazers:11Issues:2Issues:0

HeartRate-Monitoring-using-AWS-IOT-and-AWS-KINESIS

you run a script to mimic multiple sensors publishing messages on an IoT MQTT topic, with one message published every second. The events get sent to AWS IoT, where an IoT rule is configured. The IoT rule captures all messages and sends them to Firehose. From there, Firehose writes the messages in batches to objects stored in S3. In S3, you set up a table in Athena and use QuickSight to analyze the IoT data.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:2Issues:3

awesome-opensource-data-engineering

An Awesome List of Open-Source Data Engineering Projects

License:NOASSERTIONStargazers:2Issues:1Issues:0

AWS_File_Trans_Lamda_S3_SNS

AWS Data Engineering Project using Lambda, S3 and SNS

Language:PythonStargazers:2Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

dbt-glue

This repository contains de dbt-glue adapter

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:JavaScriptLicense:MIT-0Stargazers:1Issues:1Issues:0
Stargazers:0Issues:1Issues:0

amazon-kinesis-data-analytics-blueprints

Kinesis Data Analytics Blueprints are a curated collection of Apache Flink applications. Each blueprint will walk you through how to solve a practical problem related to stream processing using Apache Flink. These blueprints can be leveraged to create more complex applications to solve your business challenges in Apache Flink.

Language:TypeScriptLicense:MIT-0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0

arvados

An open source platform for managing and analyzing biomedical big data

Language:GoLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:HCLLicense:MIT-0Stargazers:0Issues:1Issues:0

aws-glue-cdk-cicd

Build, Test and Deploy ETL solutions using AWS Glue and AWS CDK based CI/CD pipelines

Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0

aws-glue-test-data-generator

AWS Glue Configurable Test Data Generator

Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0

aws-security-hub-glue-aggregator-terraform

These Terraform modules aggregate Security Hub findings to centralized account using Amazon Kinesis Firehose and AWS Glue

Language:HCLLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bigdata-file-viewer

A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.

Language:JavaLicense:GPL-2.0Stargazers:0Issues:1Issues:0

ClickHouse

ClickHouse® is a free analytics DBMS for big data

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

data-engineering

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

data-science-on-aws

AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

emr-studio-notebook-examples

This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.

License:MIT-0Stargazers:0Issues:1Issues:0
Language:ScalaLicense:MIT-0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0

monitor-serverless-datalake

Alerting and notification in a serverless data lake during failures

Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0
Language:JavaLicense:MIT-0Stargazers:0Issues:1Issues:0

nextflow

A DSL for data-driven computational pipelines

Language:GroovyLicense:Apache-2.0Stargazers:0Issues:1Issues:0

querypal

Web UI for Amazon Athena

Language:VueLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MIT-0Stargazers:0Issues:1Issues:0