kiangkiangkiang / Apartment-Recommended-system

House for you - Multiple Machine Learning Algorithm Combination, Statistical Evidence, and Web Visualization

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

House for you - Apartment Recommended System

We combine multiple machine learning algorithms, python crawling skills, Taiwan Open Datasets, Jeiba preprocessing, and 591 rent website to develop an Apartment Recommended System, which is called house for you. For the frontend visualization, we use HTML, JavaScript, CSS, and also provide statistical chart for data analysis. Through the web design and algorithm implement, the recommended system owns a high accuracy, which is evidenced by statistics called spearman correlation.

House for you provides the most suitable rented apartments according to the users' favor. Whenever you come to an alien environment, using "house for you" can solve what you are annoyed about living.

Datasets

We take advantage of multiple datasets in the Taiwan open datasets at https://data.gov.tw/dataset/6564, and combine with the same address, which will lead into a complete data table and benefit of analysis. In this way, we can facilitate the analysis of the environment effective factor of the apartment, where the user is favor of but has no understanding yet. Quantify the factor and include into the cost–performance ratio we design for the prediction.

On the other hand, crawled the 591 (https://www.591.com.tw/) apartment-rented system using python until 3 May 2019, and 5362 records have been collected. For the preprocessing, the Jeiba to segment Chinese words and regular expression was implemented. After the work, we will get the internal effective factor of the apartment that renter providing, like the image in website, furnitures, and discount, etc. Likewise, Quantify the factor and include into the cost–performance ratio we design for the prediction.

Workflow

First, take the two datasets mentioned above, use The World Geodetic System 1984 (WGS84) to convert the Latitude and Longitude, and complete the combination of data that we use for analysis.

Second, consider the distance between the apartment and working place, we designed a center location method to find the most suitable location for living.

Further, implement TFIDF and TFPDF in furnitures and Elo Rating System in the image aesthetic level for quantify into the cost–performance.

Final, combine the kmeans and scatter-gather algorithms to find the customized favor index for the cost–performance, and a complete apartment recommended list will come into the view.

Usage

(For center location method)

  1. Input working places or destinations (recommend a complete addredd)

  2. Input time limits (you can accept) respectively of the places

(For customized favor index)

  1. Choose the apartment you prefer for 5 times.

After these, the recommended list will come up.

About

House for you - Multiple Machine Learning Algorithm Combination, Statistical Evidence, and Web Visualization


Languages

Language:JavaScript 46.5%Language:HTML 40.3%Language:CSS 10.5%Language:PHP 2.7%