isro01 / ConversationalRobot

ROBO CLUB SUMMER PROJECT

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ConversationalRobot

CTC_MODEL

This model is implemented to convert the audio messages of the user into text.

ctc

CTC_Model Output

Selection_050

ENCODER - DECODER MODEL

This model is implemtented to conver the response generation part of the conversational bot.

ENCODER

encoder

DECODER

decoder

LDA MODEL

This model is implemented to add topic awareness to ENCODER - DECODER Model for better response generation.

Optimal Number of Topics

This graph shows the optimal number of topics we need to set for news articles dataset.

optimal_topics

Gensim LDA Model parameters

  • corpus — Stream of document vectors or sparse matrix of shape (num_terms, num_documents) <
  • id2word – Mapping from word IDs to words. It is used to determine the vocabulary size, as well as for debugging and topic printing.
  • num_topics — The number of requested latent topics to be extracted from the training corpus.
  • random_state — Either a randomState object or a seed to generate one. Useful for reproducibility.
  • update_every — Number of documents to be iterated through for each update. Set to 0 for batch learning, > 1 for online iterative learning.
  • chunksize — Number of documents to be used in each training chunk.
  • passes — Number of passes through the corpus during training.
  • alpha — auto: Learns an asymmetric prior from the corpus
  • per_word_topics — If True, the model also computes a list of topics, sorted in descending order of most likely topics for each word, along with their phi values multiplied by the feature-length (i.e. word count)

About pyLDAvis

  • The size of the bubbles tells us how dominant a topic is across all the documents (our corpus)
  • The words on the right are the keywords driving that topic
  • The closer the bubbles the more similar the topic. The farther they are apart the less similar
  • Preferably, we want non-overlapping bubbles as much as possible spread across the chart.

About

ROBO CLUB SUMMER PROJECT


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%Language:Shell 0.0%