Karthik's starred repositories

gpt4all

gpt4all: run open-source LLMs anywhere

continue

⏩ Open-source VS Code and JetBrains extensions that enable you to easily create your own modular AI software development system

Language:TypeScriptLicense:Apache-2.0Stargazers:11680Issues:61Issues:946

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10330Issues:98Issues:152
Language:Jupyter NotebookLicense:MITStargazers:8137Issues:73Issues:29

weaver

Programming framework for writing and deploying cloud applications.

Language:GoLicense:Apache-2.0Stargazers:4567Issues:65Issues:122

llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Language:PythonLicense:MITStargazers:2929Issues:29Issues:82

databerry

The no-code platform for building custom LLM Agents

Language:TypeScriptLicense:AGPL-3.0Stargazers:2890Issues:30Issues:212

blazingmq

A modern high-performance open source message queuing system

Language:C++License:Apache-2.0Stargazers:2479Issues:27Issues:55

peerdb

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

Language:GoLicense:NOASSERTIONStargazers:1849Issues:13Issues:289

dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Language:PythonLicense:Apache-2.0Stargazers:1805Issues:18Issues:375

ByConity

ByConity is an open source cloud data warehouse

Language:C++License:Apache-2.0Stargazers:1611Issues:36Issues:469

tableflow

The open-source CSV importer

Language:TypeScriptLicense:NOASSERTIONStargazers:1534Issues:17Issues:23

dozer

Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.

Language:RustLicense:AGPL-3.0Stargazers:1456Issues:13Issues:537

refact

WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding

Language:JavaScriptLicense:BSD-3-ClauseStargazers:1447Issues:20Issues:132

bed

Binary editor written in Go

Language:GoLicense:MITStargazers:1204Issues:20Issues:12

awesome-dbt

A curated list of awesome dbt resources

kuwala

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times

Language:JavaScriptLicense:Apache-2.0Stargazers:769Issues:13Issues:72

kratix

Kratix is an open-source framework for building platforms

Language:GoLicense:Apache-2.0Stargazers:406Issues:7Issues:25

datadm

DataDM is your private data assistant. Slide into your data's DMs

Language:PythonLicense:MITStargazers:372Issues:8Issues:7

recap

Work with your web service, database, and streaming schemas in a single format.

Language:PythonLicense:MITStargazers:305Issues:10Issues:133
Language:PythonLicense:NOASSERTIONStargazers:270Issues:2Issues:23

piicatcher

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Language:PythonLicense:Apache-2.0Stargazers:257Issues:13Issues:99

dbt-datamocktool

A dbt package for unit testing your SQL analytics models

Language:ShellLicense:Apache-2.0Stargazers:159Issues:7Issues:47

valmi-activation

⚡ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks and SaaS tools like Klaviyo, Facebook Ads, Salesforce, Braze etc. Valmi.io Customer Data Platform (CDP) helps track and ingest user activity events from websites, shopify, serverside events. https://cloud.valmi.io

Language:PythonLicense:NOASSERTIONStargazers:130Issues:5Issues:50

scaffolder

CLI tool to instantly generate skeleton project structure with boilerplate code, that's taken from configurable YAML file, to quickly kick-start your project

Language:GoLicense:MITStargazers:104Issues:5Issues:4

timeMachine

A distributed fault tolerant scheduler that is horizontally scalable 🔥

Language:GoLicense:MITStargazers:90Issues:7Issues:0

prism

Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:79Issues:3Issues:6

dbcat

Data Catalog for Databases and Data Warehouses

Language:PythonLicense:MITStargazers:29Issues:6Issues:8

heimdall

Dashboard for operating Flink jobs and deployments.

Language:SvelteLicense:Apache-2.0Stargazers:24Issues:0Issues:0

kolle

Business model representation automation

Language:ShellLicense:Apache-2.0Stargazers:9Issues:3Issues:0