gurunath's starred repositories
spark-flight-connector
A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
flight-sql-server-example
An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
data-engineering-zoomcamp
Free Data Engineering course!
airflow-dags-test-action
GitHub Action to Test Airflow Dags
lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
OpenBBTerminal
Investment Research for Everyone, Everywhere.
pykoi-rlhf-finetuned-transformers
pykoi: Active learning in one unified interface
spark-expectations
A Python Library to support running data quality rules while the spark job is running⚡
risingwave
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
gpt-migrate
Easily migrate your codebase from one framework or language to another.
jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models