Luis Maria Morales Alonso's starred repositories

quokka

Making data lake work for time series

Language:PythonLicense:Apache-2.0Stargazers:1101Issues:0Issues:0

sidewinder

Python (asyncio) Distributed Database!

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

ray-sql

Distributed SQL Query Engine in Python using Ray

Language:RustLicense:Apache-2.0Stargazers:217Issues:0Issues:0

puffin

Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg

License:MITStargazers:292Issues:0Issues:0

malloy

Malloy is an experimental language for describing data relationships and transformations.

Language:TypeScriptLicense:MITStargazers:1914Issues:0Issues:0

aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Language:PythonLicense:Apache-2.0Stargazers:3844Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:128567Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0

petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

SAnD

[Implementation example] Attend and Diagnose: Clinical Time Series Analysis Using Attention Models

License:MITStargazers:1Issues:0Issues:0

netty

Netty project - an event-driven asynchronous network application framework

Language:JavaLicense:Apache-2.0Stargazers:33056Issues:0Issues:0

storm

Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more

Language:JavaLicense:Apache-2.0Stargazers:8847Issues:0Issues:0