giellalt / lang-rus

Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Russian language

Home Page:https://giellalt.uit.no

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Stress on multi-word expressions

reynoldsnlp opened this issue · comments

Taken from reynoldsnlp/udar#19

The lexical underlying form needs to have a persistent stress mark that survives the two-level rule that reduces stresses to the right-most one. For example,...

красно-жёлтых
так как
так что
то есть

Search through an fst2strings version of a stressed transducer for any words with stresses on both sides of spaces and hyphens. Something like this: egrep ":.*[ё́̀].*(% |-).*[ё́̀]"

See giellatekno subversion revision 181941. Most of these have been fixed by changing preceding stress marks that should persist into secondary stress marks ('\u0300').

Those like красно-жёлтых came from the RNC addendum to Zalizniak, so they don't have stress marked at all. They already had a TODO comment to work with a native to place stress. I added a comment that this should be done using secondary stress marks.