Bastien Boutonnet (bastienboutonnet)

bastienboutonnet

Geek Repo

Company:@sodadata

Location:Amsterdam, The Netherlands

Home Page:http://www.bastienboutonnet.com

Twitter:@B_superhero

Github PK Tool:Github PK Tool


Organizations
bitpicky

Bastien Boutonnet's starred repositories

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8612Issues:0Issues:0

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Language:PythonLicense:BSD-3-ClauseStargazers:4736Issues:0Issues:0

JsonGenius

Get structured JSON data from any page.

Language:GoLicense:Apache-2.0Stargazers:163Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:11Issues:0Issues:0

fern

Input OpenAPI. Output SDKs and Docs.

Language:TypeScriptLicense:MITStargazers:2435Issues:0Issues:0

kaguya

A ChatGPT plugin that allows you to load and edit your local files in a controlled way, as well as run any Python, JavaScript, and bash script.

Language:JavaScriptLicense:MITStargazers:1198Issues:0Issues:0

mlops-python-package

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:372Issues:0Issues:0

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Language:PythonLicense:MITStargazers:776Issues:0Issues:0

sqlmesh

Efficient data transformation and modeling framework that is backwards compatible with dbt.

Language:PythonLicense:Apache-2.0Stargazers:1448Issues:0Issues:0

sqlglot

Python SQL Parser and Transpiler

Language:PythonLicense:MITStargazers:5921Issues:0Issues:0

knowledge_gpt

Accurate answers and instant citations for your documents.

Language:PythonLicense:MITStargazers:1546Issues:0Issues:0

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3005Issues:0Issues:0

private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:52761Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61628Issues:0Issues:0

koboldcpp

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

Language:C++License:AGPL-3.0Stargazers:4273Issues:0Issues:0

typst

A new markup-based typesetting system that is powerful and easy to learn.

Language:RustLicense:Apache-2.0Stargazers:29629Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29064Issues:0Issues:0

yobulkdev

🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative

Language:JavaScriptLicense:AGPL-3.0Stargazers:860Issues:0Issues:0

duke-data-science

All of my work and lecture notes from my ongoing time spent studying Data Science and Political Science at Duke that I can share wthout violating academic terms.

Language:TeXStargazers:4Issues:0Issues:0

cloudquery

The open source high performance ELT framework powered by Apache Arrow

Language:GoLicense:MPL-2.0Stargazers:5673Issues:0Issues:0

pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

Language:PythonLicense:Apache-2.0Stargazers:10582Issues:0Issues:0

LazyVim

Neovim config for the lazy

Language:LuaLicense:Apache-2.0Stargazers:15073Issues:0Issues:0

dbt-codegen

Macros that generate dbt code

Language:MakefileLicense:Apache-2.0Stargazers:442Issues:0Issues:0

manage-fastapi

:rocket: CLI tool for FastAPI. Generating new FastAPI projects & boilerplates made easy.

Language:PythonLicense:MITStargazers:1621Issues:0Issues:0

data-diff

Compare tables within or across databases

Language:PythonLicense:MITStargazers:2908Issues:0Issues:0

dss-plugin-model-drift

Model drift detection

Language:PythonStargazers:11Issues:0Issues:0

pyemd

Fast EMD for Python: a wrapper for Pele and Werman's C++ implementation of the Earth Mover's Distance metric

Language:C++License:MITStargazers:475Issues:0Issues:0

elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

Language:HTMLLicense:Apache-2.0Stargazers:1789Issues:0Issues:0

minos-python

🐍 Minos is a framework which helps you create reactive microservices in Python

Language:PythonLicense:MITStargazers:463Issues:0Issues:0

midarr-server

🔥Midarr, the minimal lightweight media server.

Language:ElixirLicense:MITStargazers:1201Issues:0Issues:0