map-reduce

There are 5 repositories under map-reduce topic.

chrislusf / gleam
Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
distributed-computing distributed-systems golang map-reduce
Language:Go 3543
numaproj / numaflow
Kubernetes-native platform to run massively parallel data/streaming jobs
data-processing hacktoberfest k8s kubernetes map-reduce pipeline stream-processing
Language:Rust 2387
Qihoo360 / poseidon
A search engine which can hold 100 trillion lines of log data.
poseidon search-engine golang big-data map-reduce
Language:Go 1985
JuliaFolds / Transducers.jl
Efficient transducers for Julia
julia transducers parallel high-performance map-reduce distributed-computing iterators
Language:Julia 442
Spark-with-Python
tirthajyoti / Spark-with-Python
Fundamentals of Spark with Python (using PySpark), code examples
pyspark spark apache-spark dataframe mlib machine-learning big-data database map-reduce python hdfs analytics hadoop distributed-computing parallel-computing sql apache
Language:Jupyter Notebook 355
tkf / ThreadsX.jl
Parallelized Base functions
julia high-performance map-reduce transducers sorting-algorithms parallel
Language:Julia 333
commoncrawl / cc-mrjob
Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
python map-reduce hadoop commoncrawl
Language:Python 166
phelps-sg / python-bigdata
Data science and Big Data with Python
data-science python hbase numpy numerical-methods notebook-jupyter spark map-reduce
Language:Jupyter Notebook 136
xarray-contrib / flox
Fast & furious GroupBy operations for dask.array
dask xarray map-reduce
Language:Python 133
asavinov / prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
workflow data-processing map-reduce spark pandas python feature-engineering data-science data-wrangling data-preprocessing data-preparation business-intelligence olap
Language:Python 93
daleroberts / pypar
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
map-reduce mpi python big-data
Language:Python 69
JuliaFolds / FoldsCUDA.jl
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
gpu cuda julia transducers parallel high-performance map-reduce iterators
Language:Julia 57
rvantonder / hack_parallel
The core parallel and shared memory library used by Hack, Flow, and Pyre
ocaml parallel shared-memory map-reduce
Language:OCaml 42
dragonly / pingcap_interview
pingcap 面试小作业
interview map-reduce
Language:Go 36
JuliaFolds / data-parallelism
julia parallel high-performance map-reduce distributed-computing iterators transducers franklin
Language:Julia 36
RedisGears / redisgears-py
RedisGears python client
redis redisgears python-client map-reduce stream-processing
Language:Python 27
Assifar-Karim / apollo
A lightweight modern map reduce framework brought to k8s
distributed-systems grpc k8s map-reduce object-storage go s3
Language:Go 26
Cheng-Lin-Li / Spark
There are Python 2.7 codes and learning notes for Spark 2.1.1
python27 kmeans kmeans-clustering als apriori-algorithm minhash-lsh-algorithm minhash uv-decomposition alternating-least-squares savasere-omiecinski-and-navathe apriori-son spark map-reduce tf-idf cosine-similarity
Language:Python 24
nglthu / infoRetrieval
Inverted Indexer, web crawler, sort, search and poster steamer written using Python for information retrieval.
python3 inverted-index map-reduce terms tokens heaps webcrawler stemming-algorithm information-retrieval
Language:HTML 22
CaptainCodeman / datastore-mapper
Appengine Datastore Mapper in Go
datastore-mapper datastore-entities cloud-storage shards bigquery datastore appengine go map-reduce
Language:Go 21
AsadiAhmad / Word-Counter
Word Counter with Haskell for Programming Language Design Course
haskell recursive-algorithm word-counter map-reduce
Language:Haskell 20
ihor / Phadoop
Map/reduce jobs for Hadoop in PHP
hadoop map-reduce php
Language:PHP 18
loveyacper / raft_for_dummies
2017春季MIT分布式系统课程实验
raft consensus-algorithm map-reduce golang
Language:Go 18
open-soql
shellyln / open-soql
Open source implementation of the SOQL.
soql graph-query object-query sql resolvers dml map-reduce javascript typescript library query-engine
Language:TypeScript 16
imehrdadmahdavi / map-reduce-inverted-index
Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc
mapreduce hadoop inverted-index map-reduce information-retrieval gcp dataproc clustering bigdata big-data googlecloud search-engine dataprocessing
Language:Java 14
kalmyk / fox-wamp
Web Application Message Async Server and WAMP/MQTT bridge
mqtt websocket iot map-reduce stream-processing wamp-router async-storage
Language:JavaScript 14
pscosta / go-strm
A rich Map/Reduce API in Go
mapreduce generics golang functional-programming map-reduce
Language:Go 14
fangvv / EdgeLD
Code for paper "Locally Distributed Deep Learning Inference on Edge Device Clusters"
dnn inference cluster parallel-computing speedup deep-learning edge-computing vggnet distributed-computing map-reduce parallel-algorithm workload
Language:Python 13
jsdp
gwr3n / jsdp
A Java Stochastic Dynamic Programming Library
stochastic dynamic programming java uncertainty object-oriented parallel map-reduce lambda-calculus stream inventory optimal control maintenance
Language:Java 13
futureverse / future.mapreduce
[EXPERIMENTAL] R package: future.mapreduce - Utility Functions for Future Map-Reduce API Packages
r package futures map-reduce
Language:R 12
lorenzo-stacchio / Big_Data_Course_Rimini_2021
Questa repository contiene tutto il materiale didattico utilizzato durante il corso di "Laboratorio Big Data" in collaborazione con il comune di Rimini.
data-science big-data nosql-database sql-database machine-learning data-visualization regression classification feature-extraction feature-selection spark hadoop map-reduce
Language:Jupyter Notebook 11
mesqueeb / map-anything
Array.map but for objects with good TypeScript support. A small and simple integration.
map-object object-map object-to-object mapping transform object-mapper compose map-reduce
Language:TypeScript 11
123vivekr / distributed-map-reduce
An experimental distributed map reduce system based on Google's MapReduce, written in Rust!
distributed-systems map-reduce rust-lang
Language:Rust 10
codelibra / Time-series-analysis-nyc-taxi
⏰ 📓 Time series analysis of new york taxi data
map-reduce hadoop hive time-series taxi-data new-york-city machine-learning nightlife
Language:Java 9
asuiu / streamerate
Iterable Java8 style Streams for Python
java-streams map-reduce mapreduce python python-iterables python-itertools python-mapreduce python-multiprocessing python-multithreading python-streaming python3 streaming
Language:Python 8
natelalor / AI_report_generator
A tool that converts long audio files into a thorough, summarized report. Leverages OpenAI and its API (ChatGPT backend), Langchain for text processing, and Pinecone for vector database facilitation.
artificial-intelligence chatbot embedding-models langchain map-reduce object-oriented-programming openai openai-api pinecone python vector-database
Language:Python 8

map-reduce

chrislusf / gleam

numaproj / numaflow

Qihoo360 / poseidon

JuliaFolds / Transducers.jl

tirthajyoti / Spark-with-Python

tkf / ThreadsX.jl

commoncrawl / cc-mrjob

phelps-sg / python-bigdata

xarray-contrib / flox

asavinov / prosto

daleroberts / pypar

JuliaFolds / FoldsCUDA.jl

rvantonder / hack_parallel

dragonly / pingcap_interview

JuliaFolds / data-parallelism

RedisGears / redisgears-py

Assifar-Karim / apollo

Cheng-Lin-Li / Spark

nglthu / infoRetrieval

CaptainCodeman / datastore-mapper

AsadiAhmad / Word-Counter

ihor / Phadoop

loveyacper / raft_for_dummies

shellyln / open-soql

imehrdadmahdavi / map-reduce-inverted-index

kalmyk / fox-wamp

pscosta / go-strm

fangvv / EdgeLD

gwr3n / jsdp

futureverse / future.mapreduce

lorenzo-stacchio / Big_Data_Course_Rimini_2021

mesqueeb / map-anything

123vivekr / distributed-map-reduce

codelibra / Time-series-analysis-nyc-taxi

asuiu / streamerate

natelalor / AI_report_generator