There are 5 repositories under sqoop topic.
大数据入门指南 :star:
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Repository used for Spark Trainings
Big data projects implemented by Maniram yadav
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
大数据组件学习代码
Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly.
一个增量备份关系数据库(MySQL, PostgreSQL, SQL Server, SQLite, Oracle等)到hive的php脚本工具
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).
A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)
I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perform analytics using Spark and Scala and loading the data back to HDFS.
Big Data
This repository contains all the documents related to HDPCD certification.
Big Data
🏆가물가물 : 빅데이터 분산 처리를 활용한 물가기반 식재료 가격 정보 제공 웹앱 서비스 - 🥇SSAFY 7기 특화프로젝트 우수상 1등(2022.10.07)
Predictive Analysis using Big Data platforms and Machine Learning Libraries
End to end big data project, that aims to show how to implement different big data layers, from the infrastructure layer to the end user one. [HADOOP][Spark][Kafka][Cassandra][Ansible][Jupyter][Docker]
Pizza Orders Data Pipeline Usecase Solved by SQL, Sqoop, HDFS, Hive, Airflow.
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
Hadoop Filesystem Driver for Manta