impresso / impresso-passim

This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

impresso-passim

Description

This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.

TODO: link to Programming Historian lesson Detecting Text Reuse with Passim as soon as it will be published.

License

The 'impresso - Media Monitoring of the Past' project is funded by the Swiss National Science Foundation (SNSF) under grant number CRSII5_173719 (Sinergia program). The project aims at developing tools to process and explore large-scale collections of historical newspapers, and at studying the impact of this new tooling on historical research practices. More information at https://impresso-project.ch.

Copyright (C) 2020 The impresso team (contributors to this program: Matteo Romanello).

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of merchantability or fitness for a particular purpose. See the GNU Affero General Public License for more details.

About

This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.

License:GNU Affero General Public License v3.0


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%