Simon Gonzalez's starred repositories
countries-states-cities-database
๐ Discover our global repository of countries, states, and cities! ๐๏ธ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
voice_datasets
๐ A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
awesome-ggplot2
A curated list of awesome ggplot2 tutorials, packages etc.
visNetwork
R package, using vis.js library for network visualization
language-list
List of all languages with names and ISO 639-1 codes in all languages and all data formats.
wordcloud2
R interface to wordcloud for data visualization.
ggplot2-exts.github.io
A list of ggplot2 extensions
SimpleWordlists
Word lists from the web.
verb.forms.dictionary
Verb forms dictionary
humannames
๐ฆ A list, huge one (~200K) of human male/female first/last names.
sv_score_calibration
Score calibration for speaker verification
occupations
A list of occupations
machine_readable_wordlists
A collection of word lists in machine readable, web-native (.yml and .json) format
multilingual_speech_valence_classification_datasets
Multilingual datasets with raw audio for speech emotion recognition
English-word-lists-parts-of-speech-approximate
Word lists categorized approximately by parts of speech. Parsed from open source lists as shown in details and sources. WARNING: Not suitable for language teaching purposes.
english-verbs
A database of phonologically transcribed English verbs, organized by inflection.
endangered-languages
A list of resources for conservation, preservation, development, and documentation of endangered, minority, and low or under resourced human languages.
awesome-ggplot2
A curated list of awesome ggplot2 tutorials, packages etc.