Marianna's starred repositories

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7507Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5456Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5918Issues:0Issues:0

cc-pyspark

Process Common Crawl data with Python and Spark

Language:PythonLicense:MITStargazers:400Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66143Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24676Issues:0Issues:0

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonLicense:NOASSERTIONStargazers:1330Issues:0Issues:0

thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Language:PythonLicense:MITStargazers:2812Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27779Issues:0Issues:0

openfold

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2

Language:PythonLicense:Apache-2.0Stargazers:2678Issues:0Issues:0

torchdistx

Torch Distributed Experimental

Language:PythonLicense:BSD-3-ClauseStargazers:115Issues:0Issues:0

imagenette

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:940Issues:0Issues:0

PhiFlow

A differentiable PDE solving framework for machine learning

Language:PythonLicense:MITStargazers:1391Issues:0Issues:0

Azimuth

Machine Learning-Based Predictive Modelling of CRISPR/Cas9 guide efficiency

Language:PythonLicense:BSD-3-ClauseStargazers:223Issues:0Issues:0

biomedical

Tools for curating biomedical training data for large-scale language modeling

Language:PythonStargazers:445Issues:0Issues:0

pyprobml

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Language:Jupyter NotebookLicense:MITStargazers:6417Issues:0Issues:0

twint

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Language:PythonLicense:MITStargazers:15682Issues:0Issues:0

lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

Language:PythonLicense:Apache-2.0Stargazers:5126Issues:0Issues:0

lecun1989-repro

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Language:Jupyter NotebookLicense:MITStargazers:593Issues:0Issues:0

Auto-eD

A web based tool for visualization of the forward and reverse modes of automatic differentiation

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:16Issues:0Issues:0

mit-deep-learning-book-pdf

MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Language:JavaStargazers:12635Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:184766Issues:0Issues:0

awesome-react

A collection of awesome things regarding React ecosystem

Stargazers:63921Issues:0Issues:0

traingenerator

🧙 A web app to generate template code for machine learning

Language:PythonLicense:MITStargazers:1357Issues:0Issues:0

awesome-machine-learning

A curated list of awesome Machine Learning frameworks, libraries and software.

Language:PythonLicense:NOASSERTIONStargazers:64826Issues:0Issues:0