Ramses Alexander Coraspe Valdez's repositories

uber-expenses-tracking

The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:99Issues:7Issues:3

apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

Language:VBALicense:Apache-2.0Stargazers:40Issues:6Issues:3

csv-schema-inference

A tool to automatically infer columns data types in .csv files

Language:Jupyter NotebookLicense:MITStargazers:33Issues:3Issues:5

pyDag

Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag

Language:PythonLicense:Apache-2.0Stargazers:24Issues:3Issues:1

wbz

A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler transform (BWT) and Move to front (MTF) to improve the Huffman compression. For now, this tool only will be focused on compressing .csv files, and other files on tabular format.

Language:PythonLicense:Apache-2.0Stargazers:13Issues:2Issues:0

D3JS-Dashboard

Building Responsive DashBoard with D3.js and ASP.NET MVC from scratch (SQL SERVER - SSIS - API REST)

Language:C#Stargazers:12Issues:3Issues:0

docker-livy

Dockerizing and Consuming an Apache Livy environment

Language:HTMLLicense:MITStargazers:11Issues:4Issues:0
Language:PythonLicense:MITStargazers:4Issues:2Issues:0

csv-shuffler

A tool to automatically Shuffle lines in .csv files

Language:PythonLicense:MITStargazers:4Issues:2Issues:2

livyc

Apache Spark as a Service with Apache Livy Client

Language:PythonLicense:MITStargazers:3Issues:3Issues:0

RESTful-APIs-Nodejs

Building fast, scalable and secure RESTful services with Node, Express and MongoDB

Language:HTMLLicense:MITStargazers:3Issues:3Issues:0

apache-spark-course

Apache Spark with python

Language:Jupyter NotebookLicense:MITStargazers:2Issues:3Issues:0
Language:PythonLicense:MITStargazers:2Issues:2Issues:0

Wittline

Take a look at my repository

code_challenges

Scripts for different purposes

Language:PythonStargazers:1Issues:3Issues:0
Language:PythonLicense:MITStargazers:1Issues:2Issues:0

csv-splitter

csv-splitter

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

model-catalog-grpc

A gRPC service to consume any machine learning model stored in a model catalog through a single endpoint.

License:MITStargazers:1Issues:2Issues:0

awesome-twitter-data

A list of Twitter datasets and related resources.

License:CC0-1.0Stargazers:0Issues:1Issues:0

bulk_json_sqlite

Efficiently Bulk Import a Large JSON File into SQLite

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Data-Quality

Data Quality

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dictionary-substitute

Dictionary substitute Python Coding Task

Language:PythonStargazers:0Issues:2Issues:1

fastapi-jwt

Jwt with fastapi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

fastapi-template

Completely Scalable FastAPI based template for Machine Learning, Deep Learning and any other software project which wants to use Fast API as an API framework.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

github-readme-stats

:zap: Dynamically generated stats for your github readmes

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

learning-golang

Learning golang

License:MITStargazers:0Issues:2Issues:0

nlp-recipes

Natural Language Processing Best Practices & Examples

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ray-sql

Distributed SQL Query Engine in Python using Ray

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0