jenojp / negspacy

spaCy pipeline object for negating concepts in text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Support for Spanish language

Siva227 opened this issue · comments

I used the Negex algorithm to deal with Spanish text. With the help of a Spanish Language expert, a list of negex terms based on the lists provided in the original paper was created. I'd like to contribute those to this repository. To that effect, I've created a fork of this repository. I opened this issue so there could be a discussion as to how this extension can happen.

Hey, awesome idea! Happy to collaborate on it and make it happen.

@Siva227 - I pushed a change to the develop branch that should make it cleaner/easier to handle multi language support.
commit

Perhaps you'd like to update your fork and work from here? I believe you already have located Spanish termsets. You can update negspacy/termsets.py with them and then we can add a few test cases.

Great! I'll add the termsets to negspacy/termsets.py. Just so there's no confusion, these are the tags I used and the corresponding mappings to the tags you're using.

[PREN] - Prenegation rule tag - A term which negates the medical finding that follows it
[POST] - Postnegation rule tag - A term which negates the medical finding that precedes it
[PSEU] - Pseudo negation tag - Not a negation even though it contains a negation term
[CONJ] - Conjunction tag

PREN - Preceeding negations
POST - Following negations
PSEU - Pseudo negations
CONJ - Termination

I'll proceed with these and add necessary test cases.

Yup I believe that looks right!

Closing this issue due to inactivity

Hello! Is there any work on this lately?

Not that I’m aware of

Taken care of by #51 . Thanks @j6e !