ehud baumatz's starred repositories

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:27875Issues:321Issues:5098

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:21580Issues:241Issues:2436

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18291Issues:293Issues:1276

scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language:PythonLicense:Apache-2.0Stargazers:11144Issues:90Issues:448

pycaret

An open-source, low-code machine learning library in Python

Language:Jupyter NotebookLicense:MITStargazers:8394Issues:131Issues:2267

imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Language:PythonLicense:MITStargazers:6688Issues:141Issues:579

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4113Issues:51Issues:191

deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Language:PythonLicense:NOASSERTIONStargazers:3340Issues:18Issues:972

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3236Issues:27Issues:262

texthero

Text preprocessing, representation and visualization from zero to hero.

Language:PythonLicense:MITStargazers:2865Issues:42Issues:119

mimic-code

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases

Language:Jupyter NotebookLicense:MITStargazers:2310Issues:134Issues:1164

flow-forecast

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).

Language:PythonLicense:GPL-3.0Stargazers:1881Issues:29Issues:195

tigramite

Tigramite is a python package for causal inference with a focus on time series data. The Tigramite documentation is at

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:1181Issues:39Issues:189

pytorch-ts

PyTorch based Probabilistic Time Series forecasting framework based on GluonTS backend

Language:PythonLicense:MITStargazers:1156Issues:24Issues:131

arxiv-sanity-lite

arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.

Language:PythonLicense:MITStargazers:1081Issues:22Issues:7

data-centric-ai

Resources for Data Centric AI

Language:TeXLicense:Apache-2.0Stargazers:1056Issues:67Issues:6

PURE

[NAACL 2021] A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812

Language:PythonLicense:MITStargazers:764Issues:13Issues:63

Multimodal-Transformer

[ACL'19] [PyTorch] Multimodal Transformer

Language:PythonLicense:MITStargazers:751Issues:14Issues:48

Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Language:PythonLicense:Apache-2.0Stargazers:554Issues:26Issues:50

pecos

PECOS - Prediction for Enormous and Correlated Spaces

Language:PythonLicense:Apache-2.0Stargazers:489Issues:20Issues:79

rebel

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).

sematch

semantic similarity framework for knowledge graph

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:421Issues:71Issues:34

Satellite-Imagery-Datasets-Containing-Ships

A list of radar and optical satellite datasets for ship detection, classification, semantic segmentation and instance segmentation tasks.

Video-to-Retail-Platform

An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.

Language:PythonLicense:Apache-2.0Stargazers:140Issues:15Issues:9
Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:117Issues:19Issues:2

Self-Tuning

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

X-BERT

X-BERT: eXtreme Multi-label Text Classification with BERT

Language:C++License:BSD-3-ClauseStargazers:51Issues:0Issues:0

table-transformer

CVPR 2022: Table Structure Recognition

Language:PythonLicense:MITStargazers:41Issues:3Issues:0

ChatGPT-API

Implements ChatGPT API via request package.

Language:PythonLicense:MITStargazers:32Issues:1Issues:5