Pek Yun Ning's repositories

LDA2Word_Topic_Modelling

LDA2Word is a novel topic modelling technique created in June 2019, by yours truly, Pek Yun Ning. It hybridises the older LDA and Word2Vec topic modelling algorithms to form this all-new algorithm.

Language:PythonStargazers:2Issues:2Issues:0

autofit-excel-cell-widths-using-xlwings

Autofit excel cell widths (using xlwings). Xlwings is an extremely efficient, state-of-the-art python library to manage, edit, and manipulate excel files & data. Its documentation can be viewed at https://readthedocs.org/projects/xlwings/downloads/pdf/stable.

Language:PythonStargazers:1Issues:0Issues:0

Doc2X_Topic_Modelling

Doc2X is a novel topic modelling technique created in June 2019, by yours truly, Pek Yun Ning. It hybridises the older Doc2Vec and Corex topic modelling algorithms to form this all-new algorithm.

Language:PythonStargazers:1Issues:0Issues:0

LDA2Char_Topic_Modelling

LDA2Char is a novel topic modelling technique created in June 2019, by yours truly, Pek Yun Ning. It hybridises the older LDA and FastText topic modelling algorithms to form this all-new algorithm.

Language:PythonStargazers:1Issues:0Issues:0

Logistic_Regression_-_Confusion_Matrix

An application of Logistic Regression and presentation of its accuracy and precision using Confusion Matrix. This was applied to a consumer complaints dataset.

Language:PythonStargazers:1Issues:0Issues:0

NLP-NER

Natural Language Processing and Named Entity Recognition to automatically get specific structured entities from unstructured texts / data input.

Language:PythonStargazers:1Issues:0Issues:0

Optimisation-Model_Augmentation

An Optimisation + Model Augmentation project to derive best plant growth recipe. Includes concepts like linear regression, genetic algorithm, euclidean distance, uniform criterion, and cloud computing (integration with AWS DynamoDB).

Stargazers:1Issues:0Issues:0

add-symbol-currency-2dp-to-every-excel-cell

Add symbol, currency, and/or 2 d.p. (to every excel cell)

Language:PythonStargazers:0Issues:0Issues:0

amazon-textract-enhancer

This workshop demonstrates how to build a Document parser and query engine with Amazon Textract and other services, such as ElasticSearch and DynamoDB.

Language:PythonLicense:MIT-0Stargazers:0Issues:0Issues:0

bold-excel-cells-using-openpyxl

Bold Excel Cells (using openpyxl)

Language:PythonStargazers:0Issues:0Issues:0

combine-excel-sheets-using-openpyxl

Combine excel sheets from two workbooks (using openpyxl)

Language:PythonStargazers:0Issues:0Issues:0

Corex_Topic_Modelling

Correlation Explanation (Corex) is a topic modelling technique that is great at identifying 'hidden' topics, or low-frequency-worded but representative topics, very well. It was originally created by Greg Ver Steeg.

Language:PythonStargazers:0Issues:0Issues:0

Data-Anonymisation

Anonymising data in meeting notes.

Language:PythonStargazers:0Issues:0Issues:0

Doc2Vec_Topic_Modelling

Doc2vec method of topic modelling. It's document-level of Word2vec. Builds on the concept of word vector representations.

Language:PythonStargazers:0Issues:0Issues:0

docs

TensorFlow documentation

License:Apache-2.0Stargazers:0Issues:0Issues:0

excel--unique-values-in-columns-and-difference-between-two-columns

Excel - find unique values in columns, & difference between two columns

Language:PythonStargazers:0Issues:0Issues:0

FastText_Topic_Modelling

FastText is a topic modelling technique originally created by Facebook AI Research team. Its first stable version release was on December 2018. FastText is now available on Python's gensim and scikit-learn as well.

Language:PythonStargazers:0Issues:0Issues:0

LDA2X_Topic_Modelling

LDA2X is a novel topic modelling technique created in June 2019, by yours truly, Pek Yun Ning. It hybridises the older LDA and Corex topic modelling algorithms to form this all-new algorithm.

Language:PythonStargazers:0Issues:0Issues:0

LDA2XPand_Topic_Modelling

LDA2XPand is a novel topic modelling technique created in June 2019, by yours truly, Pek Yun Ning. It hybridises the older LDA, Corex, and Word2Vec topic modelling algorithms to form this all-new algorithm.

Language:PythonStargazers:0Issues:0Issues:0

LDA_Topic_Modelling

Latent Dirichlet Allocation (LDA) is a topic modelling technique that involves a three-layered probabilistic approach, taking into account words at words, documents, and corpus level. It is accompanied by its very own unique and powerful data visualisation tool, LDAvis (as part of this code in its Pythonic version, pyLDAvis), as well.

Language:PythonStargazers:0Issues:0Issues:0

Model_Augmentation

Model Augmentation by adding more sample data points to an existing (small) dataset in a mathematical manner. Carried out using mathematical concepts such as Euclidean Distance and Uniform Criterion (max-of-min concept). This adds exploration factor to the model, reduces prediction error, and improves global accuracy (in optimisation).

Stargazers:0Issues:0Issues:0

Named-Entity-Recognition-Spacy-NLTK

Identifies first names, last names, DOBs, and IDs on scanned documents and IDs.

Language:PythonStargazers:0Issues:0Issues:0

NLP-with-Python

Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more

Stargazers:0Issues:0Issues:0

NMF_Topic_Modelling

Non-negative Matrix Factorisation (NMF) is a topic modelling technique that uses matrix factorisation concepts to identify topics and top words that describe each topic.

Language:PythonStargazers:0Issues:0Issues:0

OCR-Entities-from-JSON-AWS-Textract

Changing output of AWS Textract, from JSON to Python readable format.

Stargazers:0Issues:0Issues:0

Optimisation_for_Plant_Growth

Achieve optimal plant growth through derivation of best growth recipe (i.e. input growth parameters) using Genetic Algorithm and Modelling (e.g. Linear Regression) methods. Data sends to & retrieves from AWS' DynamoDB; the latter is a centralised system for IoT integration between controls & sensors, for an integrated smart plant growth system.

Stargazers:0Issues:0Issues:0

popup-box-to-select-file-using-tkinter

Pop-up dialog box to select file (using tkinter)

Language:PythonStargazers:0Issues:0Issues:0

retrieve-current-date-time

Retrieve current date & time

Language:PythonStargazers:0Issues:0Issues:0

sekritfiles

Test your national security credentials.

License:CC0-1.0Stargazers:0Issues:0Issues:0

Spacy-Language-Detector

Using 'spacy-langdetect' Python library. Able to identify 55 different languages.

Stargazers:0Issues:0Issues:0