Josh Wills (jwills)

jwills

Geek Repo

Company:N/A

Location:San Francisco, CA

Home Page:http://twitter.com/josh_wills

Github PK Tool:Github PK Tool

Josh Wills's starred repositories

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:RustLicense:Apache-2.0Stargazers:4248Issues:28Issues:712

harlequin

The SQL IDE for Your Terminal.

Language:PythonLicense:MITStargazers:3634Issues:24Issues:175

autolabel

Label, clean and enrich text datasets with LLMs.

Language:PythonLicense:MITStargazers:2030Issues:20Issues:250

awesome-duckdb

🦆 A curated list of awesome DuckDB resources

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonLicense:MITStargazers:1051Issues:23Issues:60

dbt-duckdb

dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

Language:PythonLicense:Apache-2.0Stargazers:881Issues:20Issues:143

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:644Issues:17Issues:63

piperider

Code review for data in dbt

Language:PythonLicense:Apache-2.0Stargazers:480Issues:14Issues:75

vscode-dbt-power-user

This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc

Language:JavaScriptLicense:MITStargazers:452Issues:8Issues:413

titan

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.

Language:PythonLicense:Apache-2.0Stargazers:390Issues:16Issues:45

pgpq

Stream Arrow data into Postgres

Language:RustLicense:MITStargazers:240Issues:6Issues:22

dbt-date

Date-related macros for dbt

Language:ShellLicense:Apache-2.0Stargazers:217Issues:4Issues:48

lea

🏃‍♀️ Minimalist alternative to dbt

Language:PythonLicense:Apache-2.0Stargazers:206Issues:3Issues:23

yato

The smallest DuckDB SQL orchestrator on Earth.

Language:PythonLicense:MITStargazers:160Issues:5Issues:3

sqloxide

Python bindings for sqlparser-rs

Language:RustLicense:MITStargazers:153Issues:3Issues:14

dbt-superset-lineage

Make dbt docs and Apache Superset talk to one another

Language:PythonLicense:MITStargazers:133Issues:5Issues:9

pypi-duck-flow

end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence

Language:PythonStargazers:127Issues:5Issues:0

duckdb_delta

DuckDB extension for Delta Lake

Language:C++License:MITStargazers:123Issues:5Issues:48

fakesnow

Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.

Language:PythonLicense:Apache-2.0Stargazers:100Issues:3Issues:39

ghost

Ghost 👻 is an experimental CLI that uses AI to generate GitHub Actions workflows, using OpenAI

Language:GoLicense:MITStargazers:92Issues:4Issues:10

saneql

Prototype compiler from SaneQL to SQL

Language:C++License:BSD-3-ClauseStargazers:68Issues:4Issues:0

octocatalog

Nicely modeled data built on the Github Archive.

dbt-ibis

Write your dbt models using Ibis

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:10

syft

Analytics event modeling framework in Typescript

Language:TypeScriptLicense:Apache-2.0Stargazers:47Issues:2Issues:1

fst

fst: flow state tool | smooth where you want it, friction where you need it when data engineering

Language:PythonLicense:Apache-2.0Stargazers:31Issues:4Issues:15

duckdb_mysql_scanner

DuckDB extension for MySQL

Language:C++License:GPL-3.0Stargazers:15Issues:1Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:14Issues:0Issues:0

dbt-duckdb-utils

Utility functions for dbt projects running on duckdb

License:MITStargazers:5Issues:1Issues:0

pytest-dbt-postgres

Unittest DBT Postgres projects

Language:PythonStargazers:3Issues:1Issues:0