MLE's repositories
amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
CLIP
Contrastive Language-Image Pretraining
comdb2
Bloomberg's distributed RDBMS
coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
dagster
A data orchestrator for machine learning, analytics, and ETL.
DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
elasticsearch
Open Source, Distributed, RESTful Search Engine
elasticsearch-learning-to-rank
Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
flashlight
A C++ standalone library for machine learning
hello-nlp
A natural language search microservice
hover
Human-Oriented Visual ExploRation
jieba
结巴中文分词
keops
KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows
knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
locust
Scalable user load testing tool written in Python
MK-SQuIT
Synthesizing Questions using Iterative Template-Filling
nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
open-llms
A list of open LLMs available for commercial use
prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
pynndescent
A Python nearest neighbor descent for approximate nearest neighbors
pytextrank
Python implementation of TextRank for phrase extraction and summarization of text documents
pytrends
Pseudo API for Google Trends
seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
stylegan2-pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
tech-talks
This repository contains the notebooks and presentations we use for our Databricks Tech Talks
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
wave
Realtime Web Apps and Dashboards for Python
wikiextractor
A tool for extracting plain text from Wikipedia dumps