Vik's repositories
databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
databricks-tpc-di
Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables
neo4j_spark_template
neo4j_spark_template
azure-dbx-ncc-tf
Azure Databricks NCC Example
calculator
A node.js demo application
chispa
PySpark test helper methods with beautiful error messages
databricks-lakehouse
This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.
dataestate-benchmarks
This repo is the public repository which hosts the TPC benchmarks we use to gauge system performance.
DemoContent
Version control for demo content to easily show demos in different clouds.
dlt-files-in-repos-demo
Demonstration of using Files in Repos with Databricks Delta Live Tables
hls-interop-workshop-jan23
Assets used in interoperability workshop made publicly available
HuggingFace-on-Azure-Databricks
Sample notebooks for optimized training and inference of Hugging Face models on Azure Databricks
ide-best-practices
Best practices for working with Databricks from an IDE
ml-in-production
Machine Learning in Production
openai-cookbook
Examples and guides for using the OpenAI API
pixels
Facilitates simple large scale processing of HLS Medical images, documents, zip files. Previously at https://github.com/dmoore247/pixels
security-analysis-tool
Security Analysis Tool (SAT) analyzes customer's Databricks account and workspace security configurations and provides recommendations that help them follow Databrick's security best practices. When a customer runs SAT, it will compare their workspace configurations against a set of security best practices and delivers a report.
spark-nlp-workshop
Public runnable examples of using John Snow Labs' NLP for Apache Spark.
system-tables-augmentation
Files for bootstrapping system tables
tech-talks
This repository contains the notebooks and presentations we use for our Databricks Tech Talks
text-to-insights
A repo that shows end-to-end instructions for using large language models, vector databases and resource efficient fine-tuning techniques to go from text to sql to insights
The-Kaggle-Workbook
Code Repository for The Kaggle Workbook, Published by Packt