miguelsantos / dictionaries

Hunspell dictionaries in UTF-8

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dictionaries

Collection of normalized and easily installable hunspell dictionaries. Useful with nodehun, nspell, and others.

Install

See each of the below packages for install guidelines.

Dictionaries

Note that normal, canonical, and preferred BCP-47 codes are used. To illustrate, as American English and Brazilian Portuguese are the most common types of English and Portuguese respectively, they get the codes en and pt.

Important: this project itself is MIT, but each index.dic and index.aff file still has its original license!

In total 91 dictionaries are provided.

Name Description License
dictionary-bg Bulgarian (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-br Breton (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-ca Catalan (GPL-2.0 OR LGPL-2.1)
dictionary-ca-valencia Catalan (Valencian) (GPL-2.0 OR LGPL-2.1)
dictionary-cs Czech GPL-2.0
dictionary-da Danish (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-de German (GPL-2.0 OR GPL-3.0)
dictionary-de-at German (Austria) (GPL-2.0 OR GPL-3.0)
dictionary-de-ch German (Switzerland) (GPL-2.0 OR GPL-3.0)
dictionary-el Modern Greek (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-el-polyton Modern Greek (Polytonic Greek) GPL-3.0
dictionary-en English (MIT AND BSD)
dictionary-en-au English (Australia) (MIT AND BSD)
dictionary-en-ca English (Canada) (MIT AND BSD)
dictionary-en-gb English (United Kingdom) (MIT AND BSD)
dictionary-en-za English (South Africa) LGPL-2.1
dictionary-eo Esperanto GPL-2.0
dictionary-es Spanish (or Castilian) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-ar Spanish (or Castilian; Argentina) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-bo Spanish (or Castilian; Bolivia) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-cl Spanish (or Castilian; Chile) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-co Spanish (or Castilian; Colombia) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-cr Spanish (or Castilian; Costa Rica) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-cu Spanish (or Castilian; Cuba) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-do Spanish (or Castilian; Dominican Republic) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-ec Spanish (or Castilian; Ecuador) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-gt Spanish (or Castilian; Guatemala) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-hn Spanish (or Castilian; Honduras) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-mx Spanish (or Castilian; Mexico) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-ni Spanish (or Castilian; Nicaragua) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-pa Spanish (or Castilian; Panama) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-pe Spanish (or Castilian; Peru) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-ph Spanish (or Castilian; Philippines) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-pr Spanish (or Castilian; Puerto Rico) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-py Spanish (or Castilian; Paraguay) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-sv Spanish (or Castilian; El Salvador) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-us Spanish (or Castilian; United States) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-uy Spanish (or Castilian; Uruguay) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-es-ve Spanish (or Castilian; Venezuela) (GPL-3.0 OR LGPL-3.0 OR MPL-1.1)
dictionary-et Estonian LGPL-2.1
dictionary-eu Basque GPL-2.0
dictionary-fa Persian Apache-2.0
dictionary-fo Faroese (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-fr French MPL-2.0
dictionary-fur Friulian GPL-2.0
dictionary-fy Western Frisian GPL-3.0
dictionary-ga Irish GPL-2.0
dictionary-gd Scottish Gaelic (or Gaelic) GPL-3.0
dictionary-gl Galician GPL-3.0
dictionary-he Hebrew AGPL-3.0
dictionary-hr Croatian (LGPL-2.1 OR SISSL)
dictionary-hu Hungarian (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-hy Armenian (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-hyw Western Armenian (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-ia Interlingua GPL-3.0
dictionary-ie Interlingue (or Occidental) Apache-2.0
dictionary-is Icelandic CC-BY-SA-3.0
dictionary-it Italian GPL-3.0
dictionary-ka Georgian MIT
dictionary-ko Korean (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-la Latin GPL-2.0
dictionary-lb Luxembourgish (or Letzeburgesch) EUPL-1.1
dictionary-lt Lithuanian BSD-3-Clause
dictionary-ltg Latgalian LGPL-2.1
dictionary-lv Latvian LGPL-2.1
dictionary-mk Macedonian GPL-3.0
dictionary-mn Mongolian GPL-2.0
dictionary-nb Norwegian Bokmål GPL-2.0
dictionary-nds Low German (or Low Saxon) GPL-3.0
dictionary-ne Nepali LGPL-2.1
dictionary-nl Dutch (or Flemish) (BSD-3-Clause OR CC-BY-3.0)
dictionary-nn Norwegian Nynorsk GPL-2.0
dictionary-oc Occitan (post 1500) GPL-2.0
dictionary-pl Polish (GPL-3.0 OR LGPL-3.0 OR MPL-2.0)
dictionary-pt Portuguese (LGPL-3.0 OR MPL-2.0)
dictionary-pt-pt Portuguese (Portugal) (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-ro Romanian (or Moldavian; or Moldovan) (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-ru Russian LGPL-3.0
dictionary-rw Kinyarwanda GPL-3.0
dictionary-sk Slovak (GPL-2.0 OR LGPL-2.1 OR MPL-1.1)
dictionary-sl Slovenian (GPL-3.0 OR LGPL-2.1)
dictionary-sr Serbian (GPL-2.0 OR LGPL-2.1 OR MPL-1.1 OR CC-BY-SA-3.0)
dictionary-sr-latn Serbian (Latin script) (GPL-2.0 OR LGPL-2.1 OR MPL-1.1 OR CC-BY-SA-3.0)
dictionary-sv Swedish LGPL-3.0
dictionary-sv-fi Swedish (Finland) LGPL-3.0
dictionary-tk Turkmen Apache-2.0
dictionary-tlh Klingon (or tlhIngan Hol) Apache-2.0
dictionary-tlh-latn Klingon (or tlhIngan Hol; Latin script) Apache-2.0
dictionary-tr Turkish MIT
dictionary-uk Ukrainian GPL-3.0
dictionary-vi Vietnamese GPL-2.0

macOS

Each dictionary can be installed on OS X by following this StackExchange answer.

Build

I’ve only tested this on macOS, but there you at least need to install:

  • wget: brew install wget (crawling)
  • hunspell: brew install hunspell (many dictionaries)
  • coreutils: brew install coreutils (many dictionaries)
  • ispell: brew install ispell (German)

Note that the GNU replacements should be setup in PATH to overwrite macOS defaults.

Contributing

Dictionaries can be added if they:

  • have a significant affix file (not just a .dic file)
  • have an open source license
  • are convertible to UTF-8 with iconv(1)

The crawling and building is done in script/crawl.sh. Add code there, similar to the existing ones, to include new dictionaries.

License

MIT © Titus Wormer

See license files in each dictionary for the licensing of index.dic and index.aff files.

About

Hunspell dictionaries in UTF-8

License:MIT License


Languages

Language:JavaScript 61.8%Language:Shell 38.2%