Chuang YU (chuangyu-robotics)

chuangyu-robotics

Geek Repo

Company:University of Manchester

Location:UK

Home Page:https://twitter.com/Alexchauncy

Twitter:@Alexchauncy

Github PK Tool:Github PK Tool

Chuang YU's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:51539Issues:936Issues:1073

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12284Issues:221Issues:606
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9734Issues:98Issues:202

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:6728Issues:282Issues:1012

ESL-CN

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2391Issues:70Issues:238

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:1203Issues:18Issues:337

Hands-On-Meta-Learning-With-Python

Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow

Language:Jupyter NotebookStargazers:1152Issues:40Issues:5

fairo

A modular embodied agent architecture and platform for building embodied agents

Language:Jupyter NotebookLicense:MITStargazers:838Issues:40Issues:395

MocapNET

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

Language:C++License:NOASSERTIONStargazers:822Issues:34Issues:121

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonLicense:MITStargazers:756Issues:15Issues:101

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++License:Apache-2.0Stargazers:635Issues:27Issues:33

awesome-rl-nlp

Curated Reinforcement Learning Resources for Natural Language Processing

License:GPL-3.0Stargazers:392Issues:23Issues:0

probing-vits

Probing the representations of Vision Transformers.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:305Issues:10Issues:7

ns-vqa

Neural-symbolic visual question answering

OSSO

From a body shape, infer the anatomic skeleton.

Language:PythonLicense:NOASSERTIONStargazers:202Issues:13Issues:14

Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Language:PythonStargazers:162Issues:5Issues:0

NPYViewer

Load and view .npy files containing 2D and 1D NumPy arrays.

Language:PythonLicense:MITStargazers:151Issues:7Issues:9

Text-Independent-Speaker-Verification

Text Independent Speaker Verification Using GE2E Loss

CICERO

The purpose of this repository is to introduce new dialogue-level commonsense inference datasets and tasks. We chose dialogues as the data source because dialogues are known to be complex and rich in commonsense.

Language:PythonLicense:MITStargazers:62Issues:5Issues:4

awesome-multi-agent

A curated list of awesome multi-agent learning papers

License:MITStargazers:46Issues:3Issues:0
Language:JavaScriptLicense:MITStargazers:42Issues:0Issues:0

REGRAD

the code for generating REGRAD dataset

human-rl

Code for human intervention reinforcement learning

Language:PythonLicense:MITStargazers:32Issues:3Issues:0

EthicsShaping

[AAAI 2018] Implementation of the Ethics Shaping approach proposed in "A low-cost ethics shaping approach for designing reinforcement learning agents"

Language:PythonStargazers:10Issues:2Issues:0

TICC-MCP

An online solver for Trust-Intent-Capability-Calibration POMDP (TICC-POMDP)

mirror

Differentiable Deep Social Projection for AssistiveHuman-Robot Communication (RSS 2022)

Language:PythonLicense:MITStargazers:5Issues:2Issues:0

MUMBAI

Multi-Person, Multimodal Board Game Affect and Interaction Analysis Dataset

Voice-Cloning

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time.

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0