metinsenturk / text-analysis

A text analysis of Hacker News dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text Analysis

This library is for us to explore natural language processing (NLP) techniques. We will be implementing some of the NLP models to the Hacker News Dataset.

Meetings

In order keep the project active, we will be meeting on a weekly basis in order discuss and determine tasks for past and future assignments.

Meeting Details

Date Meeting # Description
2018.12.04 Meeting 1 Project structure and definition is discussed.

Meetings will be updated regularly.

## To Do List

To Do List of Release 0.1.0 (Current)

  • Defining project introduction
  • Defining objectives, KPIs and goals
  • General exploration of NLP based on courses, tutorials, etc.

To Do List of Release 0.1.x (Future)

  • Achieving consensus for objectives
  • Achieving consensus for EDA
  • Achieving consensus for data cleaning
  • Achieving consensus for feature selection
  • Achieving consensus for data modeling
  • Achieving consensus for model evaluation
  • Achieving consensus for performance metrics

Planned Releases

The following versions will be decided for the given accomplishments, respectively.

  • 0.0.1 Completing Project Structure & Definition
  • 0.1.0 First Draft of Documentation
  • 0.2.0 Expoloratory Data Analysis
  • 1.0.0 Text summarization
  • 2.0.0 Text Classification

Footnote

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Project branching based on the A Successful Git Branching Model. By Vincent Driessen.

About

A text analysis of Hacker News dataset.