Max Bain (m-bain)

m-bain

Geek Repo

Company:VGG, University of Oxford

Home Page:maxbain.com

Twitter:@maxhbain

Github PK Tool:Github PK Tool


Organizations
reka-ai

Max Bain's repositories

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:8878Issues:118Issues:601

webvid

Large-scale text-video dataset. 10 million captioned short videos.

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonLicense:MITStargazers:330Issues:11Issues:45

CondensedMovies

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

video-transformers

Implementations of Transformers for Video

Language:PythonLicense:Apache-2.0Stargazers:23Issues:4Issues:0

CondensedMovies-chall

Condensed Movies Challenge 2021

clip-hitchhiker

A Clip-Hitchiker's Guide to Long Video Retrieval [Arxiv 2022]

Language:PythonStargazers:4Issues:3Issues:0

pytorch-multi-label-classifier

A pytorch implemented classifier for Multiple-Label classification

Language:PythonStargazers:3Issues:2Issues:0
Language:PythonStargazers:2Issues:2Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:2Issues:1Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

SimpleDiarization

Simple Diarization model

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

collaborative-experts

Video embeddings for retrieval - code for the paper "Use What You Have: Video retrieval using representations from collaborative experts"

Language:PythonStargazers:1Issues:2Issues:0

conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

License:NOASSERTIONStargazers:1Issues:1Issues:0

primate-behaviour-recognition

Automated Audiovisual Behaviour Recognition in Wild Primates

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

slurm_gpustat

A simple command line tool to show GPU usage on a SLURM cluster

Language:PythonStargazers:1Issues:1Issues:0

torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

video2dataset

Easily create large video dataset from video urls

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

bert-as-service

Mapping a variable-length sentence to a fixed-length vector using BERT model

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonLicense:MITStargazers:0Issues:1Issues:0