Luca Pugliese (lucapug)

lucapug

Geek Repo

Location:Salerno, Italy

Github PK Tool:Github PK Tool

Luca Pugliese's starred repositories

ruff

An extremely fast Python linter and code formatter, written in Rust.

Language:RustLicense:MITStargazers:27513Issues:74Issues:4429

chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Language:PythonLicense:MITStargazers:20909Issues:326Issues:221

prophet

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Language:PythonLicense:MITStargazers:17886Issues:438Issues:2121

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14066Issues:126Issues:3241

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:12794Issues:78Issues:985

ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Language:PythonLicense:MITStargazers:12125Issues:151Issues:779

OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:9017Issues:52Issues:249

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8057Issues:72Issues:388

pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Language:PythonLicense:NOASSERTIONStargazers:7576Issues:147Issues:1073

auto-sklearn

Automated Machine Learning with scikit-learn

Language:PythonLicense:BSD-3-ClauseStargazers:7432Issues:215Issues:1016

kestra

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

Language:JavaLicense:Apache-2.0Stargazers:6797Issues:61Issues:1687

pipreqs

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

Language:PythonLicense:Apache-2.0Stargazers:5887Issues:57Issues:262

pre-commit-hooks

Some out-of-the-box hooks for pre-commit

Language:PythonLicense:MITStargazers:4938Issues:45Issues:468

yellowbrick

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Language:PythonLicense:Apache-2.0Stargazers:4216Issues:105Issues:693
Language:Jupyter NotebookLicense:MITStargazers:4180Issues:71Issues:17

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

Language:PythonLicense:Apache-2.0Stargazers:3689Issues:44Issues:341

geemap

A Python package for interactive geospatial analysis and visualization with Google Earth Engine.

Language:PythonLicense:MITStargazers:3251Issues:113Issues:597

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:2505Issues:40Issues:157

cody

AI that knows your entire codebase

Language:TypeScriptLicense:Apache-2.0Stargazers:2056Issues:49Issues:1492

deepdiff

DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.

Language:PythonLicense:NOASSERTIONStargazers:1921Issues:26Issues:287

awesome-ai-devtools

Curated list of AI-powered developer tools.

tinyvector

A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)

Language:PythonLicense:MITStargazers:770Issues:10Issues:10

fpp3-python-readalong

Python-centered read-along of Forecasting: Principles and Practice

Language:Jupyter NotebookStargazers:428Issues:7Issues:3

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:187Issues:7Issues:9

TSDB

Time Series Data Beans: a Python toolbox loads 169 public time-series datasets for machine learning/deep learning with a single line of code.

Language:PythonLicense:BSD-3-ClauseStargazers:122Issues:6Issues:16
Language:HTMLLicense:Apache-2.0Stargazers:63Issues:0Issues:0

analyze_github_feed

Create a local dashboard to visualize and filter your GitHub feed

Language:PythonStargazers:29Issues:4Issues:0

distributed-task-queue

Distributed task queue using Celery

Language:PythonLicense:MITStargazers:2Issues:0Issues:0