Simon Gonzalez's starred repositories

countries-states-cities-database

๐ŸŒ Discover our global repository of countries, states, and cities! ๐Ÿ™๏ธ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities

Language:PHPLicense:ODbL-1.0Stargazers:6972Issues:118Issues:426

corpora

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

common-voice

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Language:TypeScriptLicense:MPL-2.0Stargazers:3275Issues:133Issues:2232

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2544Issues:46Issues:154

voice_datasets

๐Ÿ”Š A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

awesome-ggplot2

A curated list of awesome ggplot2 tutorials, packages etc.

tmap

R package for thematic maps

Language:RLicense:GPL-3.0Stargazers:852Issues:29Issues:808

visNetwork

R package, using vis.js library for network visualization

Language:JavaScriptLicense:NOASSERTIONStargazers:539Issues:32Issues:456

language-list

List of all languages with names and ISO 639-1 codes in all languages and all data formats.

Language:HTMLLicense:MITStargazers:507Issues:21Issues:14

wordcloud2

R interface to wordcloud for data visualization.

mapdeck

R interface to Deck.gl and Mapbox

rbokeh

R interface to Bokeh http://hafen.github.io/rbokeh/

Language:RLicense:NOASSERTIONStargazers:313Issues:35Issues:172

ggplot2-exts.github.io

A list of ggplot2 extensions

elpis

๐Ÿ™Š software for creating speech recognition models.

Language:PythonLicense:Apache-2.0Stargazers:151Issues:15Issues:175

SimpleWordlists

Word lists from the web.

License:MITStargazers:78Issues:3Issues:0

arincli

Ruby commands for ARIN's Reg-RWS and Whois-RWS

Language:RubyLicense:Apache-2.0Stargazers:45Issues:17Issues:14

humannames

๐Ÿ“ฆ A list, huge one (~200K) of human male/female first/last names.

Language:JavaScriptLicense:MITStargazers:35Issues:7Issues:0

speakr

speakr: A Wrapper for the Phonetic Software Praat

Language:RLicense:NOASSERTIONStargazers:24Issues:3Issues:10

sv_score_calibration

Score calibration for speaker verification

Language:PythonLicense:Apache-2.0Stargazers:23Issues:5Issues:0

PYLLR

Python toolkit for likelihood-ratio calibration of binary classifiers

Language:PythonLicense:MITStargazers:23Issues:5Issues:3

occupations

A list of occupations

machine_readable_wordlists

A collection of word lists in machine readable, web-native (.yml and .json) format

License:CC0-1.0Stargazers:18Issues:0Issues:0

multilingual_speech_valence_classification_datasets

Multilingual datasets with raw audio for speech emotion recognition

Language:PythonStargazers:18Issues:2Issues:0

kazdet

NLA-NU Kazakh Dependency Treebank

Language:PythonStargazers:8Issues:0Issues:0

English-word-lists-parts-of-speech-approximate

Word lists categorized approximately by parts of speech. Parsed from open source lists as shown in details and sources. WARNING: Not suitable for language teaching purposes.

License:UnlicenseStargazers:6Issues:2Issues:0

english-verbs

A database of phonologically transcribed English verbs, organized by inflection.

Language:Jupyter NotebookStargazers:3Issues:1Issues:0

endangered-languages

A list of resources for conservation, preservation, development, and documentation of endangered, minority, and low or under resourced human languages.

ordinal

Convert numbers into words.

Language:HaskellLicense:BSD-3-ClauseStargazers:1Issues:2Issues:0

awesome-ggplot2

A curated list of awesome ggplot2 tutorials, packages etc.

Stargazers:1Issues:0Issues:0