punnkam / ucsd-courses

Semantic search of UCSD courses built with BERT pre-trained models and Pinecone vector database.

Home Page:https://tritonspanner.streamlit.app/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Course Directory for UCSD

Semantic search of UCSD courses built BERT pre-trained models and Pinecone vector database.

Triton Spanner Page

to use

pip install -r requirements
streamlit run app.py

to scrape and retrain model

python3 scraper.py
python3 embeddings.py

make sure you have a Pinecone account and store your key in .env

note: scraped data was not thoroughly validated and there could be some name/description mismatches

About

Semantic search of UCSD courses built with BERT pre-trained models and Pinecone vector database.

https://tritonspanner.streamlit.app/


Languages

Language:Python 94.2%Language:Dockerfile 5.4%Language:Shell 0.4%