Semyon's repositories
feature-generation-benchmark
A database-like benchmark of feature generation from time-series data
spark-connect-example
An example of SparkConnect extension.
chispa
PySpark test helper methods with beautiful error messages
eren
PySpark Hive helper methods
farsante
Fake Pandas / PySpark DataFrame creator
GraphAr
An open source, standard data file format for graph data storage and retrieval
jgrapht
Master repository for the JGraphT project
jungrapht-visualization
visualization and sample code from Java Universal Network Graph ported to use JGraphT models and algorithms
NetKetTests
Some tests of NetKet library
spark
Apache Spark - A unified analytics engine for large-scale data processing
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
datafusion-comet
Apache DataFusion Comet Spark Accelerator
gex
Git Explorer: cross-platform git workflow improvement tool inspired by Magit
incubator-hugegraph
A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)
mack
Delta Lake helper methods in PySpark
OTUS_DE_Homeworks
OTUS Data Engineering Course Homework
pyspark-ai
English SDK for Apache Spark
qmlcourse
Quantum Machine Learning Community Course
ssinchenko
Personal Blog. Powered by Hugo.
tinkerpop
Apache TinkerPop - a graph computing framework
unitycatalog
Open, Multi-modal Catalog for Data & AI
VariationalEigenSolver
VES with Tensorflow Quantum