unfoldingWord / translationCore

Repository for the desktop application translationCore

Home Page:https://www.translationcore.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

translationCore does not correctly split words in language `my` in wA tool

PhotoNomad0 opened this issue · comments

Story Explanation

User Story

translationCore does not correctly split words in language my in wA tool. See in this example USFM file: 57-TIT_myanmar_judson_1835_utf8.usfm.txt. It looks like the first word in 2:1 is split into two words and a letter is dropped. Notice the two words in the word list compared to one word in the source text:

Screenshot 2022-11-05 at 3.04.55 PM.png

Notes:

  • This appears to be a tokenizer error.
  • the bug is also present in a two year old issue: #6788

Features / Specifications

  • [ ]
  • [ ]
  • [ ]

Definition of Done

  • [ ]
  • [ ]
  • [ ]

Additional Context

Mockups