Nick's repositories
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
aws-custom-credential-provider
A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role
cria
Tiny inference-only implementation of LLaMA
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
delta-examples
Delta Lake examples
delta-rs
A native Rust library for Delta Lake, with bindings into Python
spark
Apache Spark - A unified analytics engine for large-scale data processing
staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks
Dataset
News: the 4k dataset is ready for download.
dspy
Stanford DSPy: The framework for programming with foundation models
Flowise
Drag & drop UI to build your customized LLM flow using LangchainJS
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
lst-bench
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
moondream
tiny vision language model
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
openplayground
An LLM playground you can run on your laptop
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
pyicloud
A Python + iCloud wrapper to access iPhone and Calendar data.
rawdog
Generate and auto-execute Python scripts in the cli
super-json-mode
Low latency JSON generation using LLMs ⚡️
tidb
TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial
tigerbeetle
The distributed financial transactions database designed for mission critical safety and performance.
webgpu-torch
Tensor computation with WebGPU acceleration
WebODM
User-friendly, commercial-grade software for processing aerial imagery. 🛩