Joan Fontanals's repositories
elasticsearch
Free and Open Source, Distributed, RESTful Search Engine
beta9
Run serverless GPU workloads with fast cold starts on bare-metal servers, anywhere in the world
bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
bustub
The BusTub Relational Database Management System (Educational)
candle
Minimalist ML framework for Rust
CLIP_benchmark
CLIP-like model evaluation
cog
Containers for machine learning
ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
docs-1
Documentation for Redis, Redis Cloud, and Redis Enterprise
elasticsearch-labs
Notebooks & Example Apps for Search & AI Applications with Elasticsearch
examples-1
Examples for beam.cloud
GCL
Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contrastive learning framework.
haystack-core-integrations
Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwards
limbo
Limbo is a work-in-progress, in-process OLTP database management system, compatible with SQLite.
llama.cpp
LLM inference in C/C++
llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
milvus-model
The embedding/reranking model zoo help user to convert their unstructured data into embeedings
mteb
MTEB: Massive Text Embedding Benchmark
ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.
RediSearch
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.
RedisJSON
RedisJSON - a JSON data type for Redis
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
weaviate
Weaviate is an open source vector database that stores both objects and vectors, allowing for combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients.