Silvestre Losada's repositories
DL_fundamentals
Videos from my YouTube channel about Deep Learning | Videos de mi canal de YouTube acerca de Fundamentos de Deep Learning
cursos-python
Cursos completos de IA dictados por Humai
sentence_similarity_semantic_search
sentence_similarity_semantic_search
BERTSimilar
Get Similar Words and Embeddings using BERT Models
fastrank
My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".
DAE_RNN_News_Recommendation
Refer to paper "Embedding-based News Recommendation for Millions of Users" & "Article De-duplication Using Distributed Representations" published by Yahoo Japan
k-NN
🆕 A machine learning plugin which supports an approximate k-NN search algorithm for Open Distro for Elasticsearch
solr-vector-scoring
Vector Plugin for Solr: calculate dot product / cosine similarity on documents
solr-ocrhighlighting
Highlighting various OCR formats directly in Solr
solr-ocrpayload-plugin
Efficient indexing and retrieval of OCR bounding boxes in Solr
wicked-charts
Beautiful and interactive javascript charts for Java-based web applications.
nlp-datasets
A list of datasets/corpora for NLP tasks, in reverse chronological order.
box-java-sdk
The Box SDK for Java.
CoordinateAscent
Python implementation of the Coordinate Ascent algorithm
siren-join
SIREn Plugin to add relational join capabilities to Elasticsearch
awesome-awesomeness
A curated list of awesome awesomeness
awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
embedded-elasticsearch
Tool that ease up creation of integration tests with Elasticsearch
wpsolr-search-engine
bakup wpsolr
elasticsearch
Open Source, Distributed, RESTful Search Engine
lapdftext
LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance where needed). The system is open-source and provides a simple baseline function for extracting text from primary research articles using rules that developers can customize. This means that the system works quite well for most applications (and might occasionally make mistakes and extract the wrong text), but it is always possible to 'hack' your own rules and improve performance.
lapdftext-original
Automatically exported from code.google.com/p/lapdftext
book
Taming Text Book Source Code
uimafit-spring-experiments
Attempt to marry two object-lifecycle containers to bring benefits for all
yodaqa
A Question Answering system built on top of the Apache UIMA framework.