Vishwajeet Dabholkar's repositories
Pyspark-read-data-from-AWS-S3
Simple pyspark code to connect to AWS and read a csv file from S3 bucket
real-time-anomaly-detection
## Real Time Anomaly Detection in IOT Sensor Data ##### In this notebook we will be showcasing real-time data ingestion leveraging SingleStore's Pipeline function. ##### We will utilize Python in SingleStore's Notebook to generate vector embeddings, leveraging SQL support for vector processing (dot_product function) to flag anomaly.
my-python-learning
My Jupyter Notebook with all python concepts I learned
PySpark-joins
This note book contains pyspark joins examples
sparkChallenge
''' Give Jason : {"labels":[{"id":1,"name":"abs"},{"id":2,"name":"ups"}]} Get the optput as : +-----------+ |label_names| +-----------+ | [abs, ups]| +-----------+ '''
SparkLearning
This repository contains sample spark codes on which I practice to learn more about spark.
csv-to-tsv-python-code
Simple code to convert a comma separated file to tab separated file using python
Data-Profiling-in-PySpark-A-Practical-Guide
Data Profiling in PySpark: A Practical Guide by Vishwajeet Dabholkar
spaces-notebooks
Collection of notebooks for use with SingleStoreDB
file-upload-to-s3-using-boto3
Simple python code with boto3, to upload file to s3 bucket
get-ec2-insatnce-details
A simple python code using boto3 librabry to get AWS EC2 instance details based on specific region
json_to_tsv
Python code to read json and convert into tab delimited txt file using python native libraries
memsql-installtion-on-Ubuntu-VM
Step by step installation guide for installing memsql cluster on single machine with one master node, one aggregator node and one leaf node
memsql-pipeline
different memsql pipelines to load data from source to memsql target table
phidata
Build AI Assistants using function calling
Printing-Big-letters-with-Figlet
Simple python code to print big letters using Figlet library
PySpark-read-from-mysql-db
Simple pyspark code to read from mysql table and store in dataframe.
python_query_aws_rds
Python code to connect to AWS RDS using psycopg2 and query it
query_athena_get-result_in_dict
Sample code to query athena from python and get the results in list format and simple logic to convert that code to pandas dataframe and make a dict of it
quicksort_using_pyton
Simple quick sort using python
semantic-search-with-hugging-face
This notebook will perform an AI powered semantic search against a movie dataset in SingleStore. To follow along with this demo, download this notebook.
Simple-QR-code-generatory
Simple QR code generator for the link or text you provide as an input
Spark-ML-model-example
Spark’s library for machine learning is called MLlib (Machine Learning library). It’s heavily based on Scikit-learn’s ideas on pipelines.
TechGig-Code-Gladiators-2023
Problem and solutions for TechGig Code Gladiators 2023