musadac / DBLP-HADOOP

Parsing of DBLP XML USING PYTHON AND THEN EXTRACTING MEANINGFUL DATA USING HADOOP MAP-REDUCE AND MONGODB

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DBLP-HADOOP

Parsing of DBLP XML USING PYTHON AND THEN EXTRACTING MEANINGFUL DATA USING HADOOP MAP-REDUCE AND MONGODB

SYSTEM_REQUIREMENTS

  • Hadoop
  • Python3
  • MongoDB

DATA AND SOURCE

ERA AND CORE DATA (Conference Data)

PYTHON LIBRARY

  • pymongo
  • pandas
  • numpy
  • pymongo[srv]
  • lxml

About

Parsing of DBLP XML USING PYTHON AND THEN EXTRACTING MEANINGFUL DATA USING HADOOP MAP-REDUCE AND MONGODB

License:Apache License 2.0


Languages

Language:Python 100.0%