Vadim Patsalo's repositories
Alignment-Algorithm
Code Accompaniment for 'Visualization, Quantification and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data'
alphapept
A modular, python-based framework for mass spectrometry. Powered by nbdev.
chispa
PySpark test helper methods with beautiful error messages
data_engineering_project_1
My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform
databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
dbx-migrate
Scripts to help customers with one-off migrations between Databricks workspaces.
glow
An open-source toolkit for large-scale genomic analysis
hyperopt
Distributed Asynchronous Hyperparameter Optimization in Python
lakefs-dais-challenge
Data + AI Summit 2022 lakeFS challenge
logue-sdk
This repository contains all the files and tools needed to build custom oscillators and effects for the prologue synthesizer.
mlflow-workshop-part-1
Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three part series, we will cover MLflow Tracking, Projects, Models, and Model Registry.
spark-playground
Playing with different packages of the Spark...
spark-tfrecord
Read and write Tensorflow TFRecord data from Apache Spark.
terraform-aws-github-runner
Terraform module for scalable GitHub action runners on AWS
unity-catalog-setup
Notebooks, terraform, tools to enable setting up Unity Catalog