Kavita Ganesan (kavgan)

kavgan

Geek Repo

Company:@opinosis-analytics

Location:Salt Lake City

Home Page:www.opinosis-analytics.com

Twitter:@kavita_ganesan

Github PK Tool:Github PK Tool

Kavita Ganesan's starred repositories

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Language:Jupyter NotebookStargazers:1124Issues:0Issues:0

robustness-gym

Robustness Gym is an evaluation toolkit for machine learning.

Language:PythonLicense:Apache-2.0Stargazers:439Issues:0Issues:0

blog-articles

Curated List of Blog Posts From Opinosis Analytics

Stargazers:2Issues:0Issues:0

word_cloud

Python word cloud library for use within Jupyter notebook and Python apps.

Language:Jupyter NotebookStargazers:47Issues:0Issues:0

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonLicense:Apache-2.0Stargazers:14985Issues:0Issues:0

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonLicense:MITStargazers:6616Issues:0Issues:0

phrase-at-scale

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

Language:PythonStargazers:125Issues:0Issues:0
Language:Jupyter NotebookStargazers:29Issues:0Issues:0
Language:Jupyter NotebookStargazers:48Issues:0Issues:0

workshops

A few exercises for use at events.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1455Issues:0Issues:0

keras-text-classification

CNN text classification using keras

Language:PythonStargazers:15Issues:0Issues:0

ROUGE-2.0

ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.

Language:JavaLicense:Apache-2.0Stargazers:206Issues:0Issues:0

awesome-machine-learning-on-source-code

Cool links & research papers related to Machine Learning applied to source code (MLonCode)

License:CC-BY-SA-4.0Stargazers:6154Issues:0Issues:0

clinical-concepts

Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.

License:GPL-3.0Stargazers:25Issues:0Issues:0

OpinRank

OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)

Stargazers:40Issues:0Issues:0

Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.

Language:JavaLicense:NOASSERTIONStargazers:966Issues:0Issues:0

spark-examples

Examples of code in spark

Language:PythonStargazers:10Issues:0Issues:0

nlp-cloud-apis

RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.

Stargazers:15Issues:0Issues:0

opinosis-summarization

This repo contains code and dataset for the Opinosis Summarization Framework

License:Apache-2.0Stargazers:51Issues:0Issues:0

java-string-similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Language:JavaLicense:NOASSERTIONStargazers:2666Issues:0Issues:0