ihebski / ioc_extractor

IOC extractor is a mini web application to parse CTI PDF reports and extract possible IOC's per page. This will reduce the work and effort realized by CTI analysts to go through long documents.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Open Source Love svg3

IOC Extractor

IOC extractor is a mini web application to parse CTI PDF reports and extract possible IOC's per page. This will reduce the work and effort realized by CTI analysts to go through long documents.

  • Project in progress

Motivation/Goals

  • Personal learning journey of CTI work
  • Assist CTI analysts by providing a shortcuts
  • Mapping IOC's with MITRE ATT&CK

The project is not completed and there is a room for enhancment and code improvement

Future Notes:

  • Add a parser for Docx files
  • Fix issues with serialization on sqlalchemy
  • Multi-threads
  • Email results when finished
  • Add Mitre Map
  • Match extracted IOC's with threats
  • Add dashboard
  • UI enhancement

Usage

  • Script only
# Usage 

python3 ioc_extractor.py scan <PATH_PDF_REPORT>
  • Web application
python3 webapp.py

About

IOC extractor is a mini web application to parse CTI PDF reports and extract possible IOC's per page. This will reduce the work and effort realized by CTI analysts to go through long documents.

License:MIT License


Languages

Language:Python 71.5%Language:HTML 26.2%Language:Mako 2.1%Language:Procfile 0.2%