SoftwareImpacts / SIMPAC-2023-531

Software for Network-Based Textual Cohesion Analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CohesionNet

Software for Network-Based Textual Cohesion Analysis

Analyze indices of textual cohesion of an English text using semantic networks of cliques with a Shiny GUI. The indices used are those proposed in Oliveira et al. (2024).

The app was based on the script available at https://doi.org/10.24433/CO.0568085.v1, which was developed and used for the study reported in the article “Indices of Textual Cohesion by Lexical Repetition Based on Semantic Networks of Cliques” (Oliveira et al. 2024).

This app is an improvement over the script on which it was based because it can be used with any English text, provided it was manually cleaned, and uses a Shiny graphical user interface to facilitate the process of calculating the cohesion indices. Manual cleaning is necessary to ensure that the file to be processed contains a single text and that the caracters [.:?!…] are used exclusively as sentence delimiters. The process of manual cleaning must replace any other uses of these characters with different ones; for instance replacing decimal separators like in "1.5" with underscores, making it "1_5". Additionally, the app uses a different udpipe model to parse the text, with improved performance in universal part of speech tagging and lemmatization.

How to cite

Oliveira, D. A., Senna, V., & Pereira, H. B. B. (2024). Indices of textual cohesion by lexical repetition based on semantic networks of cliques. Expert Systems with Applications, 237, 121580. https://doi.org/10.1016/j.eswa.2023.121580

License

Copyright 2023 Davi Alves Oliveira

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

About

Software for Network-Based Textual Cohesion Analysis

License:Apache License 2.0


Languages

Language:Tcl 22.9%Language:R 15.2%Language:SCSS 13.8%Language:JavaScript 13.4%Language:C 11.2%Language:HTML 10.9%Language:CSS 4.0%Language:TeX 3.5%Language:C++ 2.7%Language:Makefile 0.6%Language:Python 0.4%Language:Awk 0.3%Language:Shell 0.3%Language:Lua 0.3%Language:Roff 0.2%Language:q 0.1%Language:Less 0.0%Language:Perl 0.0%Language:AppleScript 0.0%Language:Raku 0.0%Language:TypeScript 0.0%