HROlive / PATC-Big-Data-Analytics-BSC

Introduction to the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PATC: Introduction to Big Data Analytics @ BSC

Table of Contents

  1. Description
  2. Information
  3. File descriptions
  4. Certificate

Description

The objective of this course was to introduce the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects. The course brought together key information technologies used in manipulating, storing, and analyzing data.

Students were introduced to systems that can accept, store, and analyze large volumes of unstructured data. The learned skills can be used in data-intensive application areas.

Information

The overall goals of this course were the following:

  • Introduction to storage and process unstructured data. Main concepts of NoSQL databases;
  • Large-scale processing: Apache Spark and its core libraries for data manipulation, machine learning, data streams and graph analytics;
  • Characterization of a data mining problem and its relation with business intelligence, dig data and exploratory statistics;
  • Basics of Python deep learning techniques with TensorFlow;
  • Basic concepts of data visualization and tools.

More detailed information and links for the course can be found on the course website.

File descriptions

The exercises and assessments can be found in this repository and are organized in their respective folders, one for each day of the course:

Certificate

The certificate for the course can be found below:

"Introduction to Big Data Analytics" - Barcelona Supercomputing Center (Issued On: February 2023)

About

Introduction to the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects.


Languages

Language:Jupyter Notebook 97.8%Language:Shell 1.0%Language:Batchfile 0.8%Language:Java 0.4%