Romain Beaumont (rom1504)

rom1504

Geek Repo

Company:@google

Location:Paris

Home Page:http://rom1504.fr/

Github PK Tool:Github PK Tool


Organizations
camomile-project
MephisTools
PrismarineJS
ProtoDef-io
SpockBotMC
webtorrent

Romain Beaumont's repositories

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3232Issues:30Issues:247

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2117Issues:24Issues:220

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:292Issues:9Issues:33

laion-prepro

Get hundred of million of image+url from the crawling at home dataset and preprocess them

image_embeddings

Using efficientnet to provide embeddings for retrieval

Language:Jupyter NotebookLicense:MITStargazers:140Issues:3Issues:26

embedding-reader

Efficiently read embedding in streaming from any filesystem

Language:PythonLicense:MITStargazers:83Issues:4Issues:26

gpu-tester

gpu tester detects broken and slow gpus in a cluster

Language:PythonLicense:MITStargazers:61Issues:2Issues:4

any2dataset

Turn any collection of files into a dataset

Language:PythonLicense:MITStargazers:41Issues:3Issues:5

CLIP

Contrastive Language-Image Pretraining

Language:Jupyter NotebookLicense:MITStargazers:38Issues:2Issues:2

python-template

Simple python template

Language:PythonLicense:MITStargazers:37Issues:2Issues:0

audio2dataset

Easily turn large sets of audio urls to an audio dataset.

Language:PythonLicense:MITStargazers:19Issues:3Issues:6

slurm-tracking-bot

Simple slurm tracking bot to check usage

Language:PythonLicense:MITStargazers:9Issues:2Issues:0

word_knn

Quickly find closest words using an efficient knn and word embeddings

Language:PythonLicense:MITStargazers:6Issues:2Issues:5

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:3Issues:1Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

distributed-shuffle

A simple implementation of distributed shuffle, intended for learning

Language:PythonLicense:MITStargazers:2Issues:2Issues:4

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

video2numpy

Optimized library for large-scale extraction of frames and audio from video.

Language:PythonLicense:MITStargazers:2Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

aria2

aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.

Language:C++License:GPL-2.0Stargazers:0Issues:1Issues:0

embedbase

The native Software 3.0 stack

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0

EnMicroMsg.db-Password-Cracker

Crack the password of EnMicroMsg.db with brute-force attack.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

prismarine-web-client

mineflayer, running in your browser

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

wechat-dump

Dump wechat messages from android

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0