anirudhacharya

Anirudh's repositories

aeppl

Tools for an Aesara-based PPL.

Language:PythonMIT010

aesara

Theano-PyMC is a fork of the Theano library maintained by the PyMC developers

Language:PythonNOASSERTION010

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonApache-2.0010

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Language:PythonMIT010

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

Language:C++Apache-2.0010

arrow-datafusion-comet

Apache Arrow DataFusion Comet Spark Accelerator

Language:RustApache-2.0000

arrow-rs

Official Rust implementation of Apache Arrow

Language:RustApache-2.0010

datafusion

Apache DataFusion SQL Query Engine

Language:RustApache-2.0000

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Language:PythonMIT010

datasketches-cpp

Core C++ Sketch Library

Language:C++Apache-2.0010

datasketches-java

Core Sketch Library.

Language:JavaApache-2.0020

datafusion-ballista

Apache Arrow Ballista Distributed Query Engine

Apache-2.0000

datasketches-python

Apache datasketches

Apache-2.0000

dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

Language:PythonMIT010

incubator-horaedb

HoraeDB is a high-performance, distributed, cloud native time-series database.

Language:RustApache-2.0000

instructor

structured outputs for llms

Language:PythonMIT000

LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Language:C++MIT010

luminal

Deep learning at the speed of light.

Apache-2.0000

manticoresearch

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon

Language:C++GPL-2.0000

outlines

Structured Text Generation

Language:PythonApache-2.0000

oxide-enzyme

Enzyme integration into Rust. Experimental, do not use.

Language:RustApache-2.0010

pinot

Apache Pinot (Incubating) - A realtime distributed OLAP datastore

Language:JavaApache-2.0010

pyo3

Rust bindings for the Python interpreter

Language:RustNOASSERTION010

PySyft

data science on data without acquiring a copy

Language:PythonApache-2.0010

qdrant

Qdrant - Vector Search Engine and Database for the next generation of AI applications. Also available in the cloud https://qdrant.to/cloud

Language:RustApache-2.0010

ruff

An extremely fast Python linter and code formatter, written in Rust.

Language:RustMIT010

toydb

Distributed SQL database in Rust, written as a learning project

Language:RustApache-2.0010

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Language:C++NOASSERTION010

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++Apache-2.0010

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustNOASSERTION000