sdtblck's repositories

youtube_subtitle_dataset

YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training

Opensubtitles_dataset

downloads and parses subtitle dataset from opensubtitles.org

Language:PythonStargazers:14Issues:2Issues:0

stylegan2

StyleGAN2 - Official TensorFlow Implementation

Language:PythonLicense:NOASSERTIONStargazers:12Issues:2Issues:0

PDFextract

Extracting pdfs using pdfminer.six and pyPDF2

lm_dataloader

Dataloader tools for language modelling

Language:PythonLicense:MITStargazers:5Issues:3Issues:0

image-dl

A fast and simple image downloader in python

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

pbar-pool

A straightforward, dependency free way to update multiple progress bars with python's multiprocessing library.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

tputils

Utilities for TPUs

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

benchmarking

Tools for benchmarking clusters

Language:PythonStargazers:0Issues:2Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

example-mkdocs-basic

A basic MkDocs project for Read the Docs

Language:PythonStargazers:0Issues:0Issues:0

example-sphinx-basic

A basic Sphinx project for Read the Docs

Language:PythonStargazers:0Issues:0Issues:0

fish

An independent replication of `Training Neural Networks with Fixed Sparse Masks` by Sung et al.

Language:PythonStargazers:0Issues:2Issues:2

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

guesslang

Detect the programming language of a source code

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mojo

The Mojo Programming Language

Stargazers:0Issues:0Issues:0

mup

maximal update parametrization (µP)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:DockerfileStargazers:0Issues:0Issues:0

RealFakeAugment

Image augmentation functions for GAN training

Language:PythonStargazers:0Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

transformers-bloom-inference

Fast Inference Solutions for BLOOM

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Yandex-Image-Scraper

some tools for scraping images from yandex image search

Language:PythonStargazers:0Issues:2Issues:0