Jiasen's repositories
docker-hadoop-alluxio
Quick start Hadoop, Alluxio on Dcoker container cluster
docker-hadoop-spark-zeppelin
快速简易搭建2节点hadoop+spark集群,使用zeppelin交互式开发
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
alluxio-cacheTest
aws, ssh, alluxio, cache eviction policy
bigdata-docker-compose
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
cloud-morph
Decentralize, Self-host Cloud Gaming/Application
distributed-crawl
分布式爬虫实战教学
Dragonfly2
Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.
FPDA-Alluxio
File Probability Distribution Analysis Tool, Used for identifying AI scenarios and big data analysis scenarios.
github-slideshow
A robot powered training repository :robot:
go-fuse
FUSE bindings for Go
HelloGitHub
:octocat: Find pearls on open-source seashore 分享 GitHub 上有趣、入门级的开源项目
iceberg
Apache Iceberg
impala
Apache Impala
jasondrogba.github.io
✨ Welcome to Jiasen website
juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
libfuse
The reference implementation of the Linux FUSE (Filesystem in Userspace) interface
Machine-Learning-in-Action-Python3
《机器学习实战》的python3源码
multi-client-cacheTest
one master and multi workers to run a cacheTest
Python-100-Days
Python - 100天从新手到大师
quicktest-k8s
install operator and csi
spark
Apache Spark - A unified analytics engine for large-scale data processing
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)