dzianis-pirshtuk

Researches confirms that social media provides good insights on what people think, feel, concern, etc. It is expected that those insight mined from Twitter data has potential to support a better decision-making, especially in public sectors. Public sector wants to know local’s insight level; therefore they need to make sure they use the conversation from residents. However, the ground truth shows that tweets are mixed from the residents and tourist. This study investigates the best automatic fashion model to classify tweets posted by resident and tourist, in NTB. Indonesia. To do so, several consecutive phases were conducted. Those are pre-processing, data training, classification system, data testing, accuracy comparison, and result visualization. First of all, a Twitter dataset, which has 700,000 tweets posted by approximately 26,000 users in Nusa Tenggara Barat, Indonesia was prepared. The dataset divided into two sets, tweets from 4,000 users for data training and 22,000 users for data testing. Then, three popular classification algorithms were applied to the datasets. There are Multinomial Naïve Bayes, Support Vector Machines and Decision Tree. After that, 7 features are created. There are Bag of Words, Normalizer location, Total Tweet, Total Day, Tweet per Day, Total Location and Location per Day. Experiment shows that Multinomial Naïve Bayes with Bag of Words feature has 86% accuracy, while the rest of features give less than 65% accuracy. This is different with Support Vector Machines and Decision Tree results. These two algorithms produce better accuracy results excluding Bag of Words feature. It implies that Support Vector Machine and Decision Tree are more powerful when processing numerical value. However, among all classification system, Multinomial Naïve Bayes still being the most accurate algorithm for the model.

Language:PythonMIT010

conda-tensorflow

Apache-2.0010

conda_auto_activate

Automatically activate a conda environment when entering folders/project.

Language:ShellMIT010

conference

A WebRTC signaling server with support of MQTT and WebSocket as transport protocols, token based authentication (JSON Web Token) and external policy based authorization.

Language:RustMIT000

Dato-Core

The open source core of the GraphLab ML library

010

deep-face-generation-and-editing-a-survey

Deep Face Generation and Editing: A Survey

000

dest

:panda_face: One Millisecond Deformable Shape Tracking Library (DEST)

Language:C++BSD-3-Clause010

display-advertising-challenge

Criteo/Kaggle Competition of CTR prediction

Language:JavaMIT000

Dynamic-Customer-Targeting-in-R

Language:R010

dzianis-pirshtuk.github.io

Language:HTML000

gender-detector

Library for guessing a person's gender by their first name.

Language:PythonGPL-2.0010

geonamescache

geonamescache - a Python library for quick access to a subset of GeoNames data.

Language:PythonMIT000

gitlabhq

GitLab is version control for your server

Language:RubyMIT010

harvester

The Social Harvest server that exposes an API and harvests data from the web to be analyzed.

Language:GoNOASSERTION010

kaggle-2014-criteo

Language:C++NOASSERTION000

kaggle_criteo

Software for the kaggle criteo challenge

Language:C#MIT000

onnx

Open standard for machine learning interoperability

Language:C++Apache-2.0000

polyglot

Multilingual text (NLP) processing toolkit

Language:Jupyter NotebookNOASSERTION000

protobuf-to-dict

A small Python library for creating dicts from protocol buffers. Useful as an intermediate step before serialization (e.g. to JSON).

Language:PythonNOASSERTION000

python-instagram

Python Client for Instagram API

Language:PythonNOASSERTION000

realtime-analytics

Language:JavaNOASSERTION010

rep

Reproducible Experiment Platform is a collaborative software infrastructure for computational experiments on shared big datasets, which allows obtaining reproducible, repeatable results and consistent comparisons of the obtained results.

Language:PythonNOASSERTION010

tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Language:TypeScriptApache-2.0000

tfjs-models

Pretrained models for TensorFlow.js

Apache-2.0000