robertanto's repositories
Real-Time-Sound-Event-Detection
This repository contains the python implementation of a Sound Event Detection systems working in real time.
libqi-python-nvidia-jetson
Compiled qi library for interfacing with Pepper and Nao robots using python on Jetson Nano, Tx2, Xavier.
DEGramNet-torch
This repository contains the pytroch code of the method presented in "DEGramNet: Effective audio analysis based on a fully learnable time-frequency representations".
Local-LLM-UI
This repository contains the code to deploy a Mistral-based chatbot using Docker Compose and Huggingface Inference API.
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
antagonist
AnTagOnIst is a tool that supports the visual analysis and the tagging of anomalies on timeseries data
Open-Set-One-Shot-Face-Recognition
This repository contains the python implementation of a Face Recognition systems working with just ONE image for each face to recognize. The system works in an open-set configuration, it means that it is able to reject not known people.
ChatBot
A task oriented chat bot based on the MultiWOZ dataset and implemented using the RASA framework
chirpycardinal
Stanford's Alexa Prize socialbot
conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Emotional-Support-Conversation
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
Gradient-Free-Optimizers
Simple and reliable optimization with local, global, population-based and sequential techniques in numerical discrete search spaces.
jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
jetson_nano
This repository is a collection of scripts/programs I use to set up the software development environment on my Jetson Nano, TX2, and Xavier NX.
JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
json-vector-database
This is a simple API for vector similarity search built with FastAPI. It uses a JSONVectorSearch database utility for storing and querying vector embeddings.
models
A collection of pre-trained, state-of-the-art models in the ONNX format
MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
PARTNER
Repository containing code for the WWW 2021 paper on empathic rewriting
personal-emotional-dialogue-system
paper list for dialogue system
secure-flask-container-template
A template repo showing how to serve an API over HTTPS conveniently with Let's Encrypt certificates, using Certbot, Nginx, and - exemplarily - Flask, each running in a Docker container spun up through Docker Compose.
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021