John1Tang's repositories
data-misc-tools
This project hosts several tools to help with development using Hive, Spark
algorithm-practice
leetcode practice record
canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件 。阿里云DRDS( https://www.aliyun.com/product/drds )、阿里巴巴TDDL 二级索引、小表复制powerd by canal.
CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
DB-GPT
Revolutionizing Database Interactions with Private LLM Technology
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
john1tang.github.io
blog of my own, take random notes
solr-data-import-handler
Repository for DIH (Document Import Handler)
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
go-and-cloud-native
practise fo go programming and cloud-native
hadoop
Mirror of Apache Hadoop
hive
Apache Hive
incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
SparrowRecSys
A Deep Learning Recommender System
the-algorithm
Source code for Twitter's Recommendation Algorithm
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.