Nic-Nic / paper_parser

Script to parse files for scientific article URLs

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

paper_parser

The following script parses all files in all subfolders of the specified main directory (e.g. Slack archive files) and extracts any URL related/pointing to a scientific article.

DISCLAIMER: Performance of the extraction is highly dependent on the provided excluder and includer keywords below

Usage

Usage is pretty straightforward, modify the input variables at the beginning of the script as you see fit and run it.

License

Copyright (C) 2020 NicolĂ s Palacio
Contact: nicolas.palacio@bioquant.uni-heidelberg.de

GNU-GLPv3:
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. A full copy of the GNU General Public License can be found on http://www.gnu.org/licenses/.

About

Script to parse files for scientific article URLs

License:GNU General Public License v3.0


Languages

Language:Python 100.0%