titipata / affiliation_parser

Simple python parser for MEDLINE, Pubmed OA affiliation string

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

odd and incorrect substitutions in clean_text() function

simonatdrg opened this issue · comments

Why are the lines
affil_text = re.sub('2 ', ' ', affil_text)
affil_text = re.sub('2. ', ' ', affil_text)
present ?

They create incorrect zip code results with an afiiliation string such as
'Department of Audiology, Speech-Language Pathology & Deaf Studies, Towson University, Towson, MD 21252, USA. ' as the zipcode is incorrectly modified

Oh! Thanks for pointing out @simonatdrg. I was concerned about affiliation string such as 2. Department of .... I'll make changes on that!