CypressA / GreekLex-2

Python scripts for the development of GreekLex 2, a psycholinguistic database for Greek language

Home Page:http://www.psychology.nottingham.ac.uk/GreekLex/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GreekLex-2

Python scripts for the development of GreekLex 2, a psycholinguistic database for Greek language

##Examples for running the scripts GreekSyllabicParser.py

>>> parser('κανονικός')

['κα', 'νο', 'νι', 'κός']

>>> parser('άνθρωπος')

['άν', 'θρω', 'πος']



Alternatively, an entire database can be parsed but it needs to have the following structure:

word    var1    var2

αρχή      ..        ..

ζωή        ..        ..

>>> syllabifyLexicon('database.txt', False)

The output will look like:

Word    syll       syll num

αρχή    αρ-χή    2

ζωή     ζω-ή      2



GPconverter.py

>>> convert('φτώχεια','φτώ-χεια')

'ftόXa'

>>> convert('αδειανός','α-δεια-νός')

'aDjanόs'

The second argument needs to be the syllabified orthographic form. The previous script can be used for this.



Whole databases can be processed here as well. Example:

Word    syll        syll num

αρχή     αρ-χή    2

ζωή      ζω-ή      2

>>> convertLexicon('database.txt')

The output will look like:

Word    syll        phones    phonSyl

αρχή    αρ-χή     arXί         ar-Xί

ζωή     ζω-ή       zoί           zo-ί

Note that the syllabified output will entail the orthographic syllables, not the phonological ones.



GreekPhonSyllabicParser.py

>>> parser('aDjanόs')

'a-Dja-nόs'

>>> parser('άnTropos')

'άn-Tro-pos'



Database example:

phonTranscriptions      var1     var2

eksoraizmόs                  ...        ...

eksostrefίs                    ...         ...

>>> syllabifyLexicon('test.txt')

The output will look like:

Word              syll                     syll num

eksoraizmόs   e-kso-ra-i-zmόs  5

eksostrefίs      e-kso-stre-fίs      4



For any questions or bug reports, contact Antonios Kyparissiadis at:

antonios.kyparissiadis@nottingham.ac.uk or kyparissiadis@gmail.com



If you use these scripts, please cite by using this reference.

Kyparissiadis, A., van Heuven, W.J.B., Pitchford, N.J., & Ledgeway, T. (submitted). GreekLex 2: A comprehensive database with syllabic, phonological and part-of-speech information.



GreekLex 2 database is available here.

The reference list provides sources that the development of the scripts was based on.

References

Babiniotis, G. (2008). Orthografiko Lexiko tis Neas Elinikis Glossas (Orthographic Dictionary of Modern Greek Language). Athens, Greece: Kentro Lexikologias.

Botinis, A. (2011). Fonitiki tis Ellinikis (Greek phonetics). Athens, Greece: ISEL Editions.

Tzakosta, M. (2010). The importance of being voiced: Cluster formation in dialectal variants of Greek. In A. Ralli, B. D. Joseph, M. Janse, and A. Karasimos, (Eds), Electronic Proceedings of the 4th International Conference of modern Greek Dialects and Linguistic Theory (pp. 213-223). Patras, Greece: University of Patras.

Tzakosta, M. (2011). Consonantal interactions in dialectal variants of Greek: a typological approach of three-member consonant clusters. In C. Basea-Bezadakou, I. Manolessou, A. Afroudakis, G. Katsouda & S. Beis (Eds.), Modern Greek Dialectology vol. 6. (pp. 463-483). Athens, Greece: Academy of Athens – Research Center for Modern Greek Dialects.

Tzakosta, M., & Karra, A. (2011). A typological and comparative account of CL and CC clusters in Greek dialects. In Μ. Janse, B. Joseph, P. Pavlou, A. Ralli & S. Armosti (Eds.), Studies in Modern Greek Dialects and Linguistic Theory I (pp. 95–105). Nicosia, Cyprus: Kykkos Cultural Research Centre.

About

Python scripts for the development of GreekLex 2, a psycholinguistic database for Greek language

http://www.psychology.nottingham.ac.uk/GreekLex/

License:GNU General Public License v3.0


Languages

Language:Python 100.0%