jakubbortlik/accent_rating

Accent Rating

This is a collection of scripts and data I used when working on my dissertation.

recording_procedure.praat - this is the script I used for collecting the audio data (it's a mess like most Praat scripts I've written, but it was good enough to do what I wanted)

survey.sur - PsyToolkit survey for collecting metadata and running the experiment
rating_experiment.psy - The experiment itself
scale_l_to_r.txt - Code that draws a sliding scale in rating_experiment.psy with left-to-right orientation
scale_r_to_l.txt - Code that draws a sliding scale in rating_experiment.psy with right-to-left orientation
sounds.txt - List of audio files used in the experiment (audio files are not included due to license limitations)
bitmaps/ - Collection of pictures used in rating_experiment.psy
tables/ - Collection of sounds used in rating_experiment.psy. Each table has three columns: variable name, file name, duration

measure_EER.py - script that draws DET curves and calculates EER values from the sid_results_table_individual_files_filtered.csv results table. Simply run python measure_EER.py. The script creates a folder named EER_measurements which contains temporary files to verify that same and different speaker labels are correct.
sid_results_table_individual_files_filtered.csv - different-sex recordings have been filtered out

phonexia_lid_wrapper.sh - wrapper for the Phonexia LID technology (this is mainly here to show which subset of languages was used in the analysis)
speechbrain_lid_wrapper.py - wrapper for the SpeechBrain LID technology it subclasses the EncoderClassifier to extract scores for English and Czech apart from the best-matching language (the audio files are not included due to license limitations so the script does not really produce the results!)

sid4_wrapper.sh - wrapper for the Phonexia SID4 technology. A simple helper script which extracts voiceprints from all wave files in one directory and then compares all voiceprints with each other. It also saves the amount of "net speech" in each voiceprint to a file.
speechbrain_sid_wrapper.py - a wrapper for SpeechBrain's SpeakerRecognition. It has options for computing embeddings from audio files or loading previously created embeddings. It subclasses SpeakerRecognition in order to be able to perform verification on embeddings instead of on audio files. The script creates embeddings from all audio files in a directory and then performs speaker verification either on all combinations or all permutations of the embeddings set.