GMMDMDIDEMS / abbreviation-extractor

An abbreviation extractor written in Rust

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

abbreviation-extractor

The abbreviation-extractor is a tool designed to identify and extract abbreviations from PDF documents. This Rust implementation is inspired by the Schwartz-Hearst1 algorithm and is intended to be useful for researchers, scholars and people dealing with academic PDF content.

References

Footnotes

  1. Schwartz, Ariel & Hearst, Marti. (2003). A Simple Algorithm For Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 4. 451-62. 10.1142/9789812776303_0042.

About

An abbreviation extractor written in Rust