Bharath Ballamudi's repositories
100DaysOfCode
PyBites #100DaysOfCode
amundsenfrontendlibrary
Front-end service library for Amundsen
awesome
😎 Awesome lists about all kinds of interesting topics
aws-toolbox
A collection of DevOps tools including shell & python scripts that automate the boring stuff in AWS.
BERT-for-RRC-ABSA
code for our NAACL 2019 paper: "BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis"
cards-pytest
Project task tracking / todo list
cs-video-courses
List of Computer Science courses with video lectures.
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
datacatalog-tag-manager
Python package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Edator
A python package that performs exploratory data analysis for users. Additionally, it generates 3 output files that comprise of a cleaned CSV, plots and a text report.
entity_resolution
Example entity resolution workflow using PySpark
incubator-superset
Apache Superset is a Data Visualization and Data Exploration Platform
marquez-airflow
Airflow support for Marquez
marquez-python
Python client for Marquez
medium-search-app
A simple search engine to search medium stories built with streamlit and elasticsearch.
mobydq
:whale: Tool to automate data quality checks on data pipelines
multi-data-lineage-capture-py
IBM Multi-Lineage Data System
python-deequ
Python API for Deequ
text_similarity
A nlp library for text similarity based on Transformer models
zero-administration-inference-with-aws-lambda-for-hugging-face
spacy-ner-aws-lambda 🤗