e-valente / malware-collector

The goal of this project is monitoring social networks to search malicious urls. The current version is working for Twitter: Based on trending topics tweets in a certain period of time, it searches for malicious urls inside of these tweets. The system parses urls efficiently and scalably. After parsing them, it verifies if they are linked with a malicious binary file (i.e., malware). If so, it inserts the malicious url in a database.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Malware-collector

Requirements

  • python 2.7
  • mongodb

How to use:

python producer.py -m stream|history -t <#trending_topic> | -l (list trends)

Example

The following command lists top 10 trending topics in Brazil

python producer.py -l
1 - #DiaMundialRBD
2 - Jack Gilinsky
3 - #EuNaVida
4 - #WhinderssonNoAltasHoras
5 - #UFC192
6 - #AHSRedCarpet
7 - Osório
8 - MARA NOSSO ORGULHO
9 - Cormier
10 - MEU PEITO TE CHAMA LS

Once we have top ten tending topics list, we choose one for store on DB (mongo). Below, I chose the number 5 (#UFC192):

python producer.py -m stream -t 5
Streaming mode for trending topic: #UFC192
Inserted: 1
Inserted: 2
Inserted: 3
Inserted: 4
Inserted: 5
Inserted: 6

At this point, tweets related to #UFC195 are stored on MongoDB. I suggest you use tools like robomongo for visualizing data.

About

The goal of this project is monitoring social networks to search malicious urls. The current version is working for Twitter: Based on trending topics tweets in a certain period of time, it searches for malicious urls inside of these tweets. The system parses urls efficiently and scalably. After parsing them, it verifies if they are linked with a malicious binary file (i.e., malware). If so, it inserts the malicious url in a database.


Languages

Language:Python 100.0%