secutron's repositories

TesTime

collection of sample colab files

Language:Jupyter NotebookStargazers:3Issues:1Issues:0

bvh-python

Python module for parsing BVH (Biovision hierarchical data) mocap files

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

ACTOR

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

Emote-hack

using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. WIP

Language:PythonStargazers:1Issues:0Issues:0

MachineLearning-AI

This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).

Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DiffFace

DiffFace: Diffusion-based Face Swapping with Facial Guidance

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Easy-Wav2Lip

Colab for making Wav2Lip high quality and easy to use

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

football_analysis

This repository contains a comprehensive computer vision/machine learning football project that uses YOLO for object detection, Kmeans for pixel segmentation, optical flow for motion tracking, and perspective transformation to analyze player movements in football videos

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hallo-for-windows

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

License:MITStargazers:0Issues:0Issues:0

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

License:NOASSERTIONStargazers:0Issues:0Issues:0

KoProgressiveTransformersSLP

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ListenDenoiseAction

Code to reproduce the results for our SIGGRAPH 2023 paper "Listen Denoise Action"

License:NOASSERTIONStargazers:0Issues:0Issues:0

LivePortrait-Advanced-Portrait-Animation-System

LivePortrait is an advanced deep learning-based system for animating portrait images. It uses a two-stage training process to create realistic and controllable animations from static portrait images.

Language:PythonStargazers:0Issues:0Issues:0

MARLIN

[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mediapipe_pose_compare

Joint angle comparison of mediapipe prediction results bvh conversion with ground truth bvh

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

Meteor

Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capabilities. (Under Review)

License:MITStargazers:0Issues:0Issues:0

minimal-diffusion

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Stargazers:0Issues:0Issues:0

MultiTalk

[INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset"

Language:PythonStargazers:0Issues:0Issues:0

nitec

NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction (Accepted at WACV24)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

SHOW

This is the codebase for SHOW.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Speech-driven-expressions

Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)

Stargazers:0Issues:0Issues:0

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

License:Apache-2.0Stargazers:0Issues:0Issues:0