rajeshodayanchal / English-Dictionary-Open-Source

Open-Source Queryable Formatted English Dictionary, in multiple formats based on The Online Plain Text English Dictionary (OPTED) dictionary

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Open-Source English Dictionary

An open source English language dictionary with 176,023 definitions.

This is based on the Source Forge Project: MySQL English Dictionary , which in turn in based on the The Online Plain Text English Dictionary (OPTED) dictionary.

OPTED is a public domain English word list dictionary, based on the public domain portion of "The Project Gutenberg e-text of Webster's Unabridged Dictionary" which is in turn based on the 1913 US Webster's Unabridged Dictionary. See Project Gutenberg.

Since the dictionary is based on 1913 edition, it does not include any modern words (Yet!, but in a future release will include the database Princeton University's WordNet.

Getting Started

You can clone the repository by running:

git clone https://github.com/UberPython/English-Dictionary-Open-Source

Find the usage instructions here.

Data Structure

The dictionary has 3 fields

  1. word: In lowercase
  2. word type: Abbreviations describe the type, e.g. verb, noun, etc.
  3. definition: Definition of the word in sentence case

Database Schema

The table is named 'entries' and has the following schema

Column      Type             Schema
------      --------------   -------------------------------
word        varchar(25)      "word" varchar(25) NOT NULL
wordtype    varchar(20)      "wordtype" varchar(20) NOT NULL  
definition  text             "definition" text NOT NULL

Dictionary Format & Repository Structure

There are 3 formats available as of now

Format & Link Description
CSV A single file with all the words in standard CSV format
SQLITE3 A single file formatted as a SQLITE3 database
MYSQL MySQL dump that can be imported directly inside MySQL / MariaDB

Usage Instructions

Language Instructions are available for
Python CSV, SQLITE3, MYSQL

Credits

  1. Project Gutenberg: For providing the original 1913 US Webster's Unabridged Dictionary. Make sure you read the Project Gutenberg's README for license and other details if you care considering using this for commercial purposes.

  2. x16bkkamz6rkb78: For compiling the MySQL dump and releasing on Source Forge.

  3. dumblob: For providing the extraordinarily elegant mysql2sqlite tool for converting the MySQL dump to SQLite3.

About

Open-Source Queryable Formatted English Dictionary, in multiple formats based on The Online Plain Text English Dictionary (OPTED) dictionary

License:MIT License