owuQQQ's repositories

pymarl_transformers

Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems (AAMAS 2023)

Language:PythonStargazers:3Issues:0Issues:0

FSDA

Flexible Statistics and Data Analysis (FSDA) extends MATLAB for a robust analysis of data sets affected by different sources of heterogeneity. It is open source software licensed under the European Union Public Licence (EUPL). FSDA is a joint project by the University of Parma and the Joint Research Centre of the European Commission.

Language:MATLABLicense:NOASSERTIONStargazers:1Issues:0Issues:0

Life-lessons

A dataset of first-person monologue videos/transcript/annotations about "life lessons" in various domains. The main purpose is for multi-modal language analysis and modeling.

Language:PythonStargazers:1Issues:0Issues:0

TalkSHOW

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

Language:PythonStargazers:1Issues:0Issues:0

word-discovery

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cs-video-courses

List of Computer Science courses with video lectures.

Stargazers:0Issues:0Issues:0

deel-learning-course

code for deep learning courses

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

EGG

EGG: Emergence of lanGuage in Games

License:MITStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

GreedyCAS

code and data for EMNLP paper "Unsupervised Scientific Abstract Segmentation with Mutual Information"

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

HMAD

Head Movements (and facial movements) Automatic Detection

Language:RLicense:GPL-3.0Stargazers:0Issues:0Issues:0

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

License:Apache-2.0Stargazers:0Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

License:MITStargazers:0Issues:0Issues:0

Multimodal-Aphasia-Type-Detection_EMNLP_2023

This codebase contains the python scripts for the model for the "Learning Co-Speech Gesture for Multimodal Aphasia Type Detection (EMNLP 2023) ".

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

nova

NOVA is a tool for annotating and analyzing behaviours in social interactions. It supports Annotators using Machine Learning already during the coding process. Further it features both, discrete labels and continuous scores and a visuzalization of streams recorded with the SSI Framework.

Language:C#License:GPL-3.0Stargazers:0Issues:0Issues:0

object-aware-gaze-target-detection

Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)

Stargazers:0Issues:0Issues:0

Only-Noisy-Training

A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.

Language:PythonStargazers:0Issues:0Issues:0

OptML_course

EPFL Course - Optimization for Machine Learning - CS-439

Stargazers:0Issues:0Issues:0

potato

potato: portable text annotation tool

License:NOASSERTIONStargazers:0Issues:0Issues:0

pwesuite

Suite for phonetic word embeddings, especially their evaluation and baseline models.

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

speech2properties2gestures

We propose a new framework for gesture generation, aiming to allow data-driven approaches to produce more semantically rich gestures.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

License:MITStargazers:0Issues:0Issues:0

vqvib_neurips2022

Codebase for VQ-VIB implementation and color experiments based on "Trading off Utility, Informativeness, and Complexity in Emergent Communication" NeurIPS 2022

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

WhisperSeg

Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

Stargazers:0Issues:0Issues:0