Myle Ott (myleott)

myleott

Geek Repo

Location:New York, NY

Home Page:http://myleott.com

Github PK Tool:Github PK Tool

Myle Ott's starred repositories

infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Language:PythonLicense:MITStargazers:201Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34034Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32251Issues:0Issues:0

evaluation-of-nmt-bt

This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets as described in the paper: "On The Evaluation of Machine Translation Systems Trained With Back-Translation" https://arxiv.org/abs/1908.05204

Language:RubyLicense:NOASSERTIONStargazers:14Issues:0Issues:0

gpt-3

GPT-3: Language Models are Few-Shot Learners

Stargazers:15646Issues:0Issues:0

stochastic_gradient_push

Stochastic Gradient Push for Distributed Deep Learning

Language:PythonLicense:NOASSERTIONStargazers:157Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:18807Issues:0Issues:0

torchgpipe

A GPipe implementation in PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:790Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:744Issues:0Issues:0

pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset

Language:PythonLicense:MITStargazers:566Issues:0Issues:0

BLUE_Benchmark

BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.

Language:PythonLicense:NOASSERTIONStargazers:283Issues:0Issues:0

electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Language:PythonLicense:Apache-2.0Stargazers:2314Issues:0Issues:0

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonLicense:MITStargazers:4795Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29708Issues:0Issues:0

flipy

A Python linear programming interface library

Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++License:NOASSERTIONStargazers:2391Issues:0Issues:0

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:943Issues:0Issues:0

umberto

UmBERTo: an Italian Language Model trained with Whole Word Masking.

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8746Issues:0Issues:0

inflect

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Language:PythonLicense:MITStargazers:940Issues:0Issues:0

JuICe

Code for generating the JuICe dataset.

Language:PythonStargazers:37Issues:0Issues:0

vizseq

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Language:PythonLicense:MITStargazers:437Issues:0Issues:0

ClassyVision

An end-to-end PyTorch framework for image and video classification

Language:PythonLicense:MITStargazers:1589Issues:0Issues:0

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonLicense:MITStargazers:8451Issues:0Issues:0

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:944Issues:0Issues:0

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Language:PythonLicense:NOASSERTIONStargazers:1114Issues:0Issues:0

RAdam

On the Variance of the Adaptive Learning Rate and Beyond

Language:PythonLicense:Apache-2.0Stargazers:2532Issues:0Issues:0

fastBPE

Fast BPE

Language:C++License:MITStargazers:652Issues:0Issues:0

ELI5

Scripts and links to recreate the ELI5 dataset.

Language:PythonLicense:NOASSERTIONStargazers:316Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27597Issues:0Issues:0