There are 5 repositories under map-reduce topic.
Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
Efficient transducers for Julia
Fundamentals of Spark with Python (using PySpark), code examples
Parallelized Base functions
Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
Data science and Big Data with Python
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
The core parallel and shared memory library used by Hack, Flow, and Pyre
RedisGears python client
There are Python 2.7 codes and learning notes for Spark 2.1.1
Appengine Datastore Mapper in Go
Inverted Indexer, web crawler, sort, search and poster steamer written using Python for information retrieval.
Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc
[EXPERIMENTAL] R package: future.mapreduce - Utility Functions for Future Map-Reduce API Packages
Questa repository contiene tutto il materiale didattico utilizzato durante il corso di "Laboratorio Big Data" in collaborazione con il comune di Rimini.
⏰ 📓 Time series analysis of new york taxi data
Map-Reduce implemented with Scala
An experimental distributed map reduce system based on Google's MapReduce, written in Rust!
Iterable Java8 style Streams for Python
A tool that converts long audio files into a thorough, summarized report. Leverages OpenAI and its API (ChatGPT backend), Langchain for text processing, and Pinecone for vector database facilitation.