anacastrosalgado / DLPC

Dicionário da Língua Portuguesa Contemporânea (DLPC, 2001)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DLPC – Dicionário da Língua Portuguesa Contemporânea

This is a repository of the Dicionário da Língua Portuguesa Contemporânea (DLPC) published by Academia das Ciências de Lisboa (ACL).

The DLPC is a monolingual Portuguese dictionary that was coordinated by João Malaca Casteleiro. It contains around 70,000 entries and was published in 2001 in two volumes, totalling 3880 pages.

The PDF version of the printed edition was converted into XML using a customised version of the P5 schema of the TEI, while a custom-built dictionary writing system using TEI as a data model in the backend was developed to serve as an editing environment for the new and improved online edition of the dictionary.

TEI Lex-0 in action

The DLPC is currently being converted to the TEI Lex-0 format, a baseline TEI schema for encoding dictionaries, for data interoperability purposes. I'm working on making this major dictionary TEI Lex–0 native with the supervision of Toma Tasovac.

This directory collects samples of dictionary entries (xml and image) to illustrate TEI Lex-0 application.

Publications

Tasovac, T., Salgado, A., Costa, R. (2020, in press). Encoding Polylexical Units with TEI Lex-0. Slovenšcina 2.0: empirical, applied and interdisciplinary research.

Salgado, A. Costa, R., Tasovac, T. (2020, conference abstract accepted). Mapping domain labels of dictionaries. XIX EURALEX International Congress: Lexicography for Inclusion, Alexandroupolis, Greece.

Salgado, A, Costa, R., Tasovac, T. (2019). Improving the consistency of usage labelling in dictionaries with TEI Lex- 0. Lexicography, Journal of ASIALEX, pp. 133-156. Berlin: Springer Verlag. DOI: 10.1007/s40607-019-00061-x.

Salgado, A., Costa, R., Tasovac, T., Simões, A. (2019). TEI Lex-0 In Action: Improving the Encoding of the Dictionary of the Academia das Ciências de Lisboa. In I. Kosem et al. (eds.), Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference, pp. 417-433, 1-3 October 2019, Sintra, Portugal. Brno: Lexical Computing CZ, s.r.o.

Salgado, A. Costa, R. & Tasovac, T. (2019, conference abstract). TEI Lex-0: a good fit for the encoding of the Portuguese Academy Dictionary? PowerPoint slides presented at TEI Conference 2019, What is text, really? TEI and beyond, 16–20 September, University of Graz, Austria.

Other publications

Costa, R., Carvalho, S., Salgado, A., Simões, A. Tasovac, T. (2020, forthcoming). Ontologie des marques de domaines appliquée aux dictionnaires de langue générale. La lexicographie en tant que méthodologie de recherche en linguistique. Revue de Philologie Française et Romane – Langue(s) & Parole, n. 5.

Salgado, A., Sina, A., Simões, A., Costa, R., McCrae, J. (2020, in press). Challenges of Word Sense Alignment: Portuguese Language Resources. In 7th Workshop on Linked Data in Linguistics: Building tools and infrastructure, LREC 2020: LREC 2020 Workshop, Proceedings of 7th Workshop on Linked Data in Linguistics: Building Tools and Infrastructure, Marseille, France.

Ahmadi, S., McCrae, J., Nimb, S., Khan, F., Monachini, M., Pedersen, B., Declerck, T., Wissik, T., Bellandi, A., Pisani, I., Troelsgård, T., Olsen, S., Krek, S., Lipp, V., Váradi T., Simon, L., Gyorffy, A., Tiberius, C., Schoonheim, T., Ben Moshe, Y., Rudich, M., Abu Ahmad, R., Lonke, D., Kovalenko, K., Langemets, M., Kallas, J., Dereza, O., Fransen, T., Cillessen, D., Lindemann, D., Alonso, M., Salgado, A., Luis Sancho, J., Ureña-Ruiz, R.J., Porta Zamorano, J., Simov, K., Osenova, P., Kancheva, Z., Radev, I., Stanković, R., Perdih, A., & Gabrovsek, D. (2020). A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment. In Proceedings of The 12th Language Resources and Evaluation Conference (pp. 3225–3235), May Marseille, France. European Language Resources Association.

Toma Tasovac, Laurent Romary, Piotr Banski, Jack Bowers, Jesse de Does, Katrien Depuydt, Tomaž Erjavec, Alexander Geyken, Axel Herold, Vera Hildenbrandt, Mohamed Khemakhem, Snežana Petrović, Ana Salgado and Andreas Witt (2018). TEI Lex-0: A baseline encoding for lexicographic data. Version 0.8.5. DARIAH Working Group on Lexical Resources. https://dariah-eric.github.io/lexicalresources/pages/TEILex0/TEILex0.html.

Related work

Dicionário da Língua Portuguesa Contemporânnea

OntoDomLab-Med

About

Dicionário da Língua Portuguesa Contemporânea (DLPC, 2001)