Luca Pugliese's starred repositories
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
auto-sklearn
Automated Machine Learning with scikit-learn
pre-commit-hooks
Some out-of-the-box hooks for pre-commit
yellowbrick
Visual analysis and diagnostic tools to facilitate machine learning model selection.
Tools-to-Design-or-Visualize-Architecture-of-Neural-Network
Tools to Design or Visualize Architecture of Neural Network
h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
safetensors
Simple, safe way to store and distribute tensors
awesome-ai-devtools
Curated list of AI-powered developer tools.
tinyvector
A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)
fpp3-python-readalong
Python-centered read-along of Forecasting: Principles and Practice
cartography
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
analyze_github_feed
Create a local dashboard to visualize and filter your GitHub feed
distributed-task-queue
Distributed task queue using Celery