chethanuk's repositories
chethanuk
Config files for my GitHub profile.
airflow-1
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
apache-spark-docker
Apache Spark Docker for Jupyter notebook or local development
dbt-tidb
A dbt adapter for TiDB
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
docs
Dapr user documentation, used to build docs.dapr.io
easy-amazon-sagemaker-deployments
SageMaker custom deployments made easy
external-dns
Configure external DNS servers (AWS Route53, Google CloudDNS and others) for Kubernetes Ingresses and Services
flink-cluster-template
Flink K8s Cluster Template
flink-kubernetes-operator
Apache Flink Kubernetes Operator
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
incubator-pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
mlflow
Mlflow Docker Image
phidata
Build AI Assistants with memory, knowledge and tools.
python-deequ
Python API for Deequ
sdkman-db-migrations
Database migrations for the sdkman API
spark
Apache Spark - A unified analytics engine for large-scale data processing
sql_questions
Basic SQL questions to practice
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zenml
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.