Jordan T Bates (jtbates)

jtbates

Geek Repo

Company:@liftoffio

Location:Washington

Github PK Tool:Github PK Tool


Organizations
dssg

Jordan T Bates's starred repositories

lemmy

🐀 A link aggregator and forum for the fediverse

Language:RustLicense:AGPL-3.0Stargazers:12995Issues:114Issues:2745

prql

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Language:RustLicense:Apache-2.0Stargazers:9664Issues:45Issues:977

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Language:C++License:NOASSERTIONStargazers:8451Issues:353Issues:1267

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7313Issues:219Issues:1449

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:5962Issues:156Issues:3278

statsforecast

Lightning ⚡️ fast forecasting with statistical and econometric models.

Language:PythonLicense:Apache-2.0Stargazers:3763Issues:38Issues:317

SentEval

A python tool for evaluating the quality of sentence embeddings.

Language:PythonLicense:NOASSERTIONStargazers:2071Issues:47Issues:58

Cream

This is a collection of our NAS and Vision Transformer work.

Language:PythonLicense:MITStargazers:1626Issues:36Issues:156

solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Language:PythonLicense:MITStargazers:1392Issues:11Issues:159

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:1067Issues:26Issues:408

rank_bm25

A Collection of BM25 Algorithms in Python

Language:PythonLicense:Apache-2.0Stargazers:930Issues:10Issues:31

gnn-model-explainer

gnn explainer

Language:PythonLicense:Apache-2.0Stargazers:835Issues:22Issues:30

python-deequ

Python API for Deequ

Language:PythonLicense:Apache-2.0Stargazers:677Issues:18Issues:157

vicreg

VICReg official code base

Language:PythonLicense:MITStargazers:506Issues:8Issues:23

self_supervised

A Pytorch-Lightning implementation of self-supervised algorithms

Language:PythonLicense:MITStargazers:502Issues:12Issues:13

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonLicense:Apache-2.0Stargazers:443Issues:10Issues:87

decagon

Graph convolutional neural network for multirelational link prediction

Language:Jupyter NotebookLicense:MITStargazers:442Issues:24Issues:16

ANCE

A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks

Language:PythonLicense:MITStargazers:354Issues:11Issues:15

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Language:PythonLicense:Apache-2.0Stargazers:315Issues:6Issues:31

MPNet

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

Language:PythonLicense:MITStargazers:286Issues:13Issues:20

ditto

Code for the paper "Deep Entity Matching with Pre-trained Language Models"

Language:PythonLicense:Apache-2.0Stargazers:250Issues:6Issues:24

InfoGraph

Official code for "InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization" (ICLR 2020, spotlight)

GRAND

Source code and dataset of the NeurIPS 2020 paper "Graph Random Neural Network for Semi-Supervised Learning on Graphs"

Language:PythonLicense:MITStargazers:202Issues:5Issues:13

dviz-course

Data visualization course material

Language:TeXLicense:MITStargazers:135Issues:11Issues:7

Neural-Attentive-Session-Based-Recommendation-PyTorch

A PyTorch implementation of Neural Attentive Session Based Recommendation (NARM)

Language:PythonLicense:GPL-3.0Stargazers:103Issues:6Issues:1

oli-torus

Next Generation OLI Authoring and Delivery Platform

Language:ElixirLicense:MITStargazers:79Issues:16Issues:1548

evalRS-CIKM-2022

Official Repository for EvalRS @ CIKM 2022: a Rounded Evaluation of Recommender Systems

Language:Jupyter NotebookLicense:MITStargazers:67Issues:7Issues:4

DatosMex

Proyecto de Python para interactuar con las bases de datos abiertas en México

Language:PythonStargazers:39Issues:3Issues:0

CEDS-Data-Warehouse-Parquet

The Common Education Data Standards (CEDS) Data Warehouse Parquet (DW Parquet) standard is designed for data engineering and data science needs in the cloud. The DW Parquet Models mirror the SQL-based CEDS Data Warehouse. Parquet files are designed for rapid and distributed reporting across multiple technology stacks, data processing and BI tools, and are cloud vendor agnostic. This standard is ideal for stakeholders implementing reporting structures in a data lake environment.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:7Issues:0

GPT-GNN

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Language:PythonLicense:MITStargazers:1Issues:0Issues:0