Sundaram Surampudi's repositories
awesome-healthcare
Curated list of awesome open source healthcare software, libraries, tools and resources.
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
awesome-ontology
A curated list of ontology things
awesome-privacy
Awesome Privacy - A curated list of services and alternatives that respect your privacy because PRIVACY MATTERS.
azure-docs
Open source documentation of Microsoft Azure
best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
duckdb-extension-radar
This repo contains information about DuckDB extensions found on GitHub. Refreshed daily
fibo
The Financial Industry Business Ontology (FIBO) defines the sets of things that are of interest in financial business applications and the ways that those things can relate to one another. In this way, FIBO can give meaning to any data (e.g., spreadsheets, relational databases, XML documents) that describe the business of finance.
hoppscotch
👽 Open source API development ecosystem - https://hoppscotch.io
lakehouse-tacklebox
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.
metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
operator-lifecycle-manager
A management framework for extending Kubernetes with Operators
PolarsVsPySpark
can Polars crunch 27GBs of data faster than Pyspark?
Stirling-PDF
locally hosted web application that allows you to perform various operations on PDF files
swirl-search
Swirl queries anything with an API then uses Large Language Models to re-rank the unified results without copying any data! Includes zero-code configs for Apache Solr, ChatGPT, Elastic Search, AWS OpenSearch, PostgreSQL, Google BigQuery, Generic HTTP/S, Google PSE, NLResearch.com, Miro, Microsoft 365, HubSpot, Atlassian, YouTrack, GitHub & more!
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.