Overview

This repo contains the code for the presentation of our talk on how to use LLMs for CTI purposes.

Use-cases for LLMs in CTI

In general, there are a couple of use-cases for LLMs in CTI. The most important use cases are:

UC 1: Summarization of free text CTI
UC 2: NER (Name Entity Recognition)
UC 3: Q&A (Answering questions on CTI texts via RAG)
UC 4: TTP Tagging (extract the TTPs from the text)
UC 5: Graph relationship extraction: extract the graph of who did what with with tools against whom etc... (the "w" questions).

Please note that UC 5 can help the other use-cases. If you have the graph of the relationships in a texth, then answering questions (UC 3) becomes easier.

Each use-case has its own subdirectory, please go to the individual subdirs and check their README files.

Dataset attribution

The STIX reports are pulled from the following sources:

Palo Alto UNIT42 github
CISA advisories website

About

An LLM for CTI reports - to be presented at FIRST Fukuoka 2024

ai cti cybersecurity llms

Languages

Language:Jupyter Notebook 67.6%Language:HTML 15.8%Language:Python 15.4%Language:JavaScript 0.9%Language:CSS 0.3%Language:Dockerfile 0.0%Language:Makefile 0.0%