jacobsenj / ext-tika

A TYPO3 CMS extension that provides Apache Tika functionality

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Apache Tika for TYPO3

Build Status Scrutinizer Code Quality Code Coverage Latest Stable Version License Total Downloads Monthly Downloads PHP Version Require

A TYPO3 CMS extension that provides Apache Tika functionality including

  • text extraction
  • meta data extraction
  • language detection (from strings or files)

Tika can be used as standalone Tika app/jar, Tika server, and via SolrCell integrated in Apache Solr.

We're open for contributions !

Please find further information regarding Apache Tika on the project's homepage

Continuous Integration

We use GitHub Actions for continuous integration.

To run the test suite locally, please use our DDEV docker environment https://github.com/TYPO3-Solr/solr-ddev-site.

Note: This requires a proper combination of branches:

  • solr-ddev-site on release-12.0.x branch
    • packages/ext-solr on release-12.0.x
    • packages/ext-tika on release-12.0.x
  • Please refer to version matrix for proper combination of branches
ddev enable tika
ddev tests-unit-tika
ddev tests-integration-tika

Contributions

  1. Fork the repository
  2. Clone repository
  3. Create a new branch
  4. Make your changes
  5. Commit your changes to your fork. In your commit message refer to the issue number if there is already one, e.g. [BUGFIX] short description of fix (resolves #4711)
  6. Submit a Pull Request (here are some hints on How to write the perfect pull request)

Keep your fork in sync with original repository

  1. git remote add upstream https://github.com/TYPO3-Solr/ext-tika.git
  2. git fetch upstream
  3. git checkout master
  4. git merge upstream/master
  5. git push origin master

About

A TYPO3 CMS extension that provides Apache Tika functionality

License:GNU General Public License v3.0


Languages

Language:PHP 69.5%Language:HTML 21.7%Language:Shell 7.2%Language:Rich Text Format 1.3%Language:JavaScript 0.3%