About | Features | Technologies | Requirements | Starting | License | Author
Describe your project It is Course Recommendation Engine built uisng similarity matrix techinique in python with help of pandas and numpy.
- Neat text β for cleaning of descriptions and titles
- Pandas β for data frame handling
- Sklearn CountVectorizer,TfidfVectorizer
- Sklearn cosine similarity,linear kernel
- First we will get all the data and use description and title for getting keywords. We will use neat text for getting rid of all pro propositions and other characters from the description.
- Building a new column of clean titles and descriptions from original columns to help convert them.
- now we will convert the cleaned titles into Vector using sklearn CountVectorizer,TfidfVectorizer.
- Now we will use cosine similarity to generate a similarity score between courses and other parameters
- Now we will sort the scores to get the top 10 scores when a keyword is entered.
Cosine similarity is the measure of similarity between two vectors, by computing the cosine of the angle between two vectors projected into multidimensional space. It can be applied to items available on a dataset to compute similarity to one another via keywords or other metrics. Similarity between two vectors (A and B) is calculated by taking the dot product of the two vectors and dividing it by the magnitude value as shown in the equation below. We can simply say that the CS score of two vectors increases as the angle between them decreases.
βοΈ Feature 1; βοΈ Feature 2; βοΈ Feature 3;
The following tools were used in this project:
Before starting π, you need to have Git and python installed.
# Clone this project
$ git clone https://github.com/{{YOUR_GITHUB_USERNAME}}/crs
# Access
$ cd crs
# Install dependencies
$ pip install -r requirements.txt
# Run the project
$ streamlit run app.py
# The server will initialize in the <http://localhost:3000>
This project is under license from MIT. For more details, see the LICENSE file.
Made with β€οΈ by Paritosh Tripathi