ansegura7 / DSL_Analysis

Descriptive text analysis of the words contained in the Dictionary of the Spanish language (DSL).

Home Page:https://ansegura7.github.io/DSL_Analysis/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Descriptive Text Analysis of the DSL

Descriptive text analysis of the words contained in the Dictionary of the Spanish language (DSL).

Text Analysis

  1. Approximate number of words in the DSL
  2. Number of words with acute accent in Spanish language
  3. Frequency of words per size
  4. Top 15 bigger words
  5. Frequency of letters in DSL words
  6. Vowel and consonant ratio
  7. Frequency of words per letter of the alphabet
  8. Most frequent n-grams

Analysis

You can see the analysis here.

Acknowledgment

I would like to express my gratitude to Giusseppe Domínguez for compiling in plain text the vast majority of words from the dictionary of the Spanish language (DSL).

Contributing and Feedback

Any kind of feedback/suggestions would be greatly appreciated (algorithm design, documentation, improvement ideas, spelling mistakes, etc...). If you want to make a contribution to the course you can do it through a PR.

Author

  • Created by Andrés Segura Tinoco
  • Created on Aug 20, 2020
  • Updated on Aug 2, 2021

License

This project is licensed under the terms of the MIT license.

About

Descriptive text analysis of the words contained in the Dictionary of the Spanish language (DSL).

https://ansegura7.github.io/DSL_Analysis/

License:MIT License


Languages

Language:HTML 81.3%Language:Jupyter Notebook 18.7%