This is a Python module which can that can convert a paragraph of spoken english to written english.
For example, "two dollars" should be converted to $2. Abbreviations spoken as "C M" or "Triple A" should be written as "CM" and "AAA" respectively.
- We can define a structure for rules and save them in a file and retrive them when needed and we will add new rules as we discover.
- By pipelining these rules we can check for rules in a paragraph one by one.
Here are some possible future functionalities that can be covered in the future versions of the module:
-
If the paragraph contains a money figure e.g. two million three thousand nine hundred and eighty-four then we may convert it to numbers as 2003984.
-
Handling of both American number system and Indian number system e.g. million, lakhs.
-
Handling of Dates e.g. Today's Date is twenty-eight October two thousand twenty as Today's Date is 28-10-2020/2020-10-28.
-
Handling of Punctuation.
-
Handling of proper spaces after one sentance.