Soumojit Ghosh's repositories
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
mage-ai
🧙 Mage is an open-source tool for building and running data pipelines that transform your data.
metaflow
:rocket: Build and manage real-life data science projects with ease!
mlflow
Open source platform for the machine learning lifecycle
posthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and a/b testing that you can self-host.
prefect
The easiest way to coordinate your dataflow
awesome-rust
A curated list of Rust code and resources.
beanie
Asynchronous Python ODM for MongoDB
casbin
An authorization library that supports access control models like ACL, RBAC, ABAC in Golang: https://discord.gg/S5UjpzGZjN
cd1898-Observing-Cloud-Resources
Project for cd1898, Course 1 of nd087
click
Python composable command line interface toolkit
dgs-framework
GraphQL for Java with Spring Boot made easy.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
gluonts
Probabilistic time series modeling in Python
lucene
Apache Lucene open-source search software
metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
modin
Modin: Scale your Pandas workflows by changing a single line of code
OpenSearch
🔎 Open source distributed and RESTful search engine.
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
skypilot
SkyPilot is a framework for easily running machine learning workloads on any cloud through a unified interface.
streamlit
Streamlit — The fastest way to build data apps in Python
the-algorithm
Source code for Twitter's Recommendation Algorithm
tigerbeetle
The distributed financial accounting database designed for mission critical safety and performance.
trio
Trio – a friendly Python library for async concurrency and I/O
twitter-server
Twitter-Server defines a template from which services at Twitter are built
typer
Typer, build great CLIs. Easy to code. Based on Python type hints.