thechargedneutron

followers

following

stars

UT Austin

Austin, TX

thechargedneutron.github.io

@chargedneutron_

Kumar Ashutosh's starred repositories

DROID-SLAM

Language:PythonBSD-3-Clause168300

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.0791500

WHAM

Language:PythonMIT59700

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python313800

TaskGraph

Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023

Language:PythonNOASSERTION900

AStar

A 2D A Star (A*) pathfinding implementation in C# focused on ease of use

Language:C#MIT12600

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonApache-2.0315600

Multimodal-Graph-Script-Learning

Non-Sequential Graph Script Induction via Multimedia Grounding (ACL 2023)

Language:PythonMIT1100

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.0503700

VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Language:Jupyter NotebookMIT22100

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonMIT24800

HierVL

[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings

Language:PythonNOASSERTION4200

CA-SUM

A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames", Proc. ACM ICMR 2022

Language:PythonNOASSERTION2600

videojs-annotation-comments

A plugin for video.js to add support for timeline moment/range comments and annotations

Language:JavaScriptNOASSERTION16700

untrunc

Restore a truncated mp4/mov. Improved version of ponchio/untrunc

Language:C++GPL-2.0190700

AGQA_baselines_code

Language:PythonMIT1800

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2373300

PreSumm

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Language:PythonMIT127700

BRIO

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

Language:Python32500

TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

Language:PythonGPL-3.042400

MatchSum

Code for ACL 2020 paper: "Extractive Summarization as Text Matching"

Language:Python51900

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.012954400

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT332000

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonNOASSERTION1230100

EgoVLP

[NeurIPS2022] Egocentric Video-Language Pretraining

Language:Python22000

pyskl

A toolbox for skeleton-based action recognition.

Language:PythonApache-2.091000

openpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Language:C++NOASSERTION3048300

audioset_tagging_cnn

Language:PythonMIT128600

UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Language:PythonMIT33200

PhraseCutDataset

Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"

Language:Jupyter Notebook9900