Michel SEBAG's repositories
starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
entity-resolution
Entity resolution, also known as Data Matching or Record linkage
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
trubrics-sdk
The first user analytics platform for AI models
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
spotlight
Interactively explore unstructured datasets from your dataframe.
genai-stack
Langchain + Docker + Neo4j
autolabel
Label, clean and enrich text datasets with LLMs.
openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
LLMs-as-Zero-Shot-Conversational-RecSys
Evaluation data, LLMs query code and results for "Large Language Models as Zero-Shot Conversational Recommenders" on CIKM 2023.
statsforecast
Lightning ⚡️ fast forecasting with statistical and econometric models.
diagrams
:art: Diagram as Code for prototyping cloud system architectures
elevenlabs-python
The official Python API for ElevenLabs text-to-speech.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
neovim
Vim-fork focused on extensibility and usability
pandera
A light-weight, flexible, and expressive statistical data testing library
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
alpa
Training and serving large-scale neural networks
memgraph
Open-source graph database, built for real-time streaming data, compatible with Neo4j.
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
pygwalker
PyGWalker: Turn your pandas dataframe into a Tableau-style User Interface for visual analysis
pandas-ai
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
phoenix
ML Observability in a Notebook - Uncover Insights, Surface Problems, Monitor, and Fine Tune your Generative LLM, CV and Tabular Models
mlops-zoomcamp
Free MLOps course from DataTalks.Club
python_secrets
Python CLI for managing secrets and eliminating default passwords in FOSS
pycaret-book
Repository for the book Simplifying Machine Learning with PyCaret.