fasttext-embeddings

There are 5 repositories under fasttext-embeddings topic.

jasoncao11 / nlp-notebook
NLP 领域常见任务的实现，包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。
textcnn textrcnn bilstm-crf-model bilstm-attention fasttext-embeddings transformer-pytorch bert-chinese textrcnn-bert distill-bert seq2seq gpt2 text-classification glove skip-gram nlp pytorch bert natural-language-processing bert-ner electra
Language:Python 533
dccuchile / spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora
nlp spanish word-embeddings glove-embeddings fasttext-embeddings word2vec-embeddinngs
359
explosion / floret
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
spacy fasttext fasttext-embeddings word-vectors word-embeddings subword-embeddings
Language:C++ 320
avidale / compress-fasttext
Tools for shrinking fastText models (in gensim format)
python nlp word-embeddings fasttext-embeddings fasttext
Language:Jupyter Notebook 180
thinkingmachines / christmAIs
Text to abstract art generation for the holidays!
machine-learning abstract-art perception fasttext-embeddings xmas
Language:Python 90
ikergarcia1996 / MetaVec
A monolingual and cross-lingual meta-embedding generation and evaluation framework
embedding embedding-evaluation embedding-models embedding-vectors embeddings emnlp2021 fasttext fasttext-embeddings meta-embedding meta-embeddings word2vec
Language:Python 80
Persian-Sentiment-Analyzer
ashalogic / Persian-Sentiment-Analyzer
Persian sentiment analysis ( آناکاوی سهش های فارسی | تحلیل احساسات فارسی )
lstm persian persian-nlp persian-sentiment sentiment-analysis machine-learning python dotnet-core javascript farsi fasttext fasttext-embeddings word2vec embeddings persian-sentiment-analysis tutorial colab persian-sentiment-analyzer tensorflow nlp
Language:Jupyter Notebook 55
hbahadirsahin / nlp-experiments-in-pytorch
PyTorch repository for text categorization and NER experiments in Turkish and English.
pytorch nlp turkish-language text text-categorization ner named-entity-recognition english-language fasttext-embeddings torchtext charcnn textcnn vdcnn conv-deconv transformer padam text-classification text-classifier conditional-random-fields
Language:Python 36
cambridgeltl / ContrastiveBLI
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
bilingual-lexicon-induction word-translation contrastive-learning self-learning cross-lingual-word-embeddings mbert pytorch word-alignment cross-lingual-embeddings bilingual-lexicon-extraction bilingual-word-embedding word-embeddings fasttext-embeddings bilingual-dictionary-induction cross-lingual-word-embedding low-resource-machine-translation information-retrieval machine-translation
Language:Python 34
ashokc / Word-Embeddings-and-Document-Vectors
An evaluation of word-embeddings for classification
fasttext-embeddings word2vec elasticsearch scikitlearn-machine-learning naive-bayes-classifier neural-networks
Language:Python 32
JoyeBright / DeepSentiPers
Repository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"
sentiment-analysis wordembeddings keras neural-network lstm cnn fasttext-embeddings classification polarity opinion-mining score persian-sentiment persian-sentiment-analysis data-augmentation deep-neural-networks corpus dataset architectures
Language:Jupyter Notebook 32
PlanTL-GOB-ES / lm-legal-es
Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
fasttext-embeddings language-model legal-texts roberta spanish-language
28
priyanshu2103 / Sanskrit-Hindi-Machine-Translation
Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning
fasttext-embeddings hindi machine-translation monolingual-corpora parallel-corpus sanskrit sanskrit-english
Language:Jupyter Notebook 19
tien02 / ensemble-roberta-fasttext-vietnamese
Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.
bert bert-fine-tuning fasttext fasttext-embeddings fine-tuning gensim-word2vec lstm mlp-classifier nlp pytorch pytorch-lightning svm-classifier text-classification phobert sentiment-analysis sentiment-classification
Language:Python 16
cambridgeltl / BLICEr
Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
bilingual-dictionary-induction bilingual-lexicon-extraction bilingual-lexicon-induction bilingual-word-embedding cross-encoder cross-lingual-embeddings cross-lingual-word-embeddings fasttext-embeddings pytorch reranking self-learning word-alignment word-embeddings word-translation xlm-r xlm-roberta low-resource-machine-translation cross-lingual-word-embedding information-retrieval machine-translation
Language:Python 13
BlackKakapo / Romanian-Word-Embeddings
Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).
nlp word2vec romanian romanian-language cbow skip-gram fasttext fasttext-embeddings vectors corpus sentences words vocabulary genism
12
pranaychandekar / fasttext-embeddings-with-flair
This project contains the code to use custom fasttext embeddings with flair framework.
fasttext-embeddings flair nlp machine-learning
Language:Python 11
pythonandml / dlbook
Repository for the free online book Oddly Satisfying Deep Learning from Scratch (link below!)
backpropagation cnn-classification cnn-keras convolutional-neural-networks elmo-embedding fasttext-embeddings glove-embeddings keras mlp python pytorch word-embeddings word2vec mlp-scratch-numpy
Language:Jupyter Notebook 11
MastersProject / fast-detection-of-duplicate-bug-report
Machine learning- based solution to the problem of duplicity in the bug reports repository.
duplicate-reports classification clustering detection lda topic-modeling bag-of-words word2vec-model glove-embeddings fasttext-embeddings pca duplicate-bug-report cosine-similarity euclidean-distances top-n-recommendations eclipse bug-tracking-system feature-extraction data-preprocessing multimodality
Language:Jupyter Notebook 10
PlanTL-GOB-ES / Biomedical-Word-Embeddings-for-Spanish
Biomedical Word embeddings generated from Spanish Biomedical corpora.
embeddings spanish-language biomedical fasttext-embeddings
10
BotCenter / spanishWordEmbeddings
Spanish Word Embeddings computed from large corpora and different sizes using fastText.
nlp natural-language-processing embeddings fasttext-embeddings spanish spanish-language
9
manashpratim / Tweet-Classification
Detect hate speech in tweets
natural-language-processing tweet-classifier bidirectional-lstm glove-embeddings fasttext-embeddings
Language:Jupyter Notebook 9
miladfa7 / Persian-Word-Embedding
Persian Word Embedding using FastText, BERT, GPT and GloVe | تعبیه کلمات فارسی با روش های مختلف
bert fasttext-embeddings gpt persian persian-nlp word-embeddings word-vectors
7
TheSaintIndiano / Fake-News-Detection
Let's hunt Fake News using Word2Vec, GloVe, FastText or learnt from corpus German embeddings.
news fake-news articles word2vec glove-embeddings fasttext-embeddings matplotlib seaborn plotly pca tsne sklearn naive-bayes xgboost keras lstm python jupyter
Language:HTML 7
barbaraneves / gender-bias-in-virtual-assistants
Final Project of the Data Science postgraduate class at MDCC/UFC
data-science gender-detection gender-classification toxic-comment-classification toxicity virtual-assistant dialogue-systems deep-learning bert-model lstm-model fasttext-embeddings gender-bias gender-based-violence
Language:Jupyter Notebook 5
Tr-topicter
apdullahyayik / Tr-topicter
🔍 A simple topic detector.
turkish-nlp turkish-language topic-classification text-classification fasttext fasttext-embeddings python
Language:Python 4
quamernasim / Role-Based-Access-Control-of-Qdrant-Vector-Database
Explore how to perform Role Based Access Control in Qdrant Vector Datase
fasttext-embeddings qdrant qdrant-client qdrant-vector-database rbac rbac-configuration rbac-roles role-based-access-control
Language:Jupyter Notebook 4
mhmdsab / Spam-Classifier
spam classifier with a dataset of 5000 mail
spam-classifier machine-learning python tf-idf-vectorizer word2vec fasttext fasttext-embeddings fasttext-model spam-filtering keras embeddings
Language:Jupyter Notebook 3
reyeon1209 / PressCheck
✔머신러닝 기반 온라인 기사 분석 서비스✔
beautifulsoup4 expressjs fasttext-embeddings kobert node-js nodemailer reactjs word-rank
Language:Python 3
sambitbhaumik / siamese-nn-sts
Project files contain PyTorch implementations for Siamese BiLSTM models for Semantic Text Similarity task on the SICK Dataset using FastText embeddings. Also contains Siamese BiLSTM-Transformer Encoder and SBERT fine-tuning implementations on the STS Data tasks.
bilstm nlp sbert self-attention sentence-embeddings sentence-similarity sentence-transformers transformers fasttext-embeddings pytorch
Language:Jupyter Notebook 3
Vidhi1290 / Word2Vec-and-FastText-Word-Embedding-with-Gensim-in-Python
This project explores the realm of Natural Language Processing (NLP) using Word2Vec and FastText models. Dive into domain-specific embeddings, analyze clinical trials data related to Covid-19, and uncover the power of AI and ML in understanding textual data.🌟
fasttext-embeddings genism jupyter-notebook machine-learning matplotlib nlp nltk numpy pandas plotly python streamlit word2vec
Language:Jupyter Notebook 3
FrederickRoman / fasttextAPI
Unofficial minified fastetext API. Use it to run NLP DL models that require word embeddings on the client-side.
fasttext fasttext-embeddings machine-learning natural-language-processing nextjs nlp-apis public-api rest-api word-embeddings pwa-app
Language:TypeScript 2
mmarouen / marabou
natural language processing and computer vision use cases for non technical user
cnn computer-vision deep-learning elmo embeddings fasttext-embeddings lstm machine-learning natural-language-processing tf-idf word2vec
Language:CSS 2
shamiul94 / Amazon-Review-Classifier-FastText-LSTM
This is one of my fun projects. It's a review classifier based on Amazon's reviews dataset hosted on Kaggle. I used FastText and Deep Learning model LSTM to build it.
fasttext fasttext-embeddings fasttext-python deep-learning lstm-neural-networks lstm word2vec-model word2vec classifier-model review amazon kaggle kaggle-dataset python rnn keras gensim tutorial
Language:Jupyter Notebook 2
mariagabv / lasolana-embeddings
Word Embeddings for the town of La Solana (Ciudad Real)
fasttext-embeddings wordembeddings lasolana
Language:HTML 1
quamernasim / Hindi-Language-AI-Chatbot-for-Enterprises-using-Qdrant-LangChain-Ollama-Llama3-FastText-and-MLFlow
RAG powered AI chatbot for Indian Language (Hindi) using LangChain, Ollama, Qdrant, and MLFlow
docker fasttext-embeddings gradio langchain langchain-python llama3 llama3-meta-ai mlflow mlflow-tracking ollama qdrant qdrant-client qdrant-vector-database
Language:Jupyter Notebook 1

fasttext-embeddings

jasoncao11 / nlp-notebook

dccuchile / spanish-word-embeddings

explosion / floret

avidale / compress-fasttext

thinkingmachines / christmAIs

ikergarcia1996 / MetaVec

ashalogic / Persian-Sentiment-Analyzer

hbahadirsahin / nlp-experiments-in-pytorch

cambridgeltl / ContrastiveBLI

ashokc / Word-Embeddings-and-Document-Vectors

JoyeBright / DeepSentiPers

PlanTL-GOB-ES / lm-legal-es

priyanshu2103 / Sanskrit-Hindi-Machine-Translation

tien02 / ensemble-roberta-fasttext-vietnamese

cambridgeltl / BLICEr

BlackKakapo / Romanian-Word-Embeddings

pranaychandekar / fasttext-embeddings-with-flair

pythonandml / dlbook

MastersProject / fast-detection-of-duplicate-bug-report

PlanTL-GOB-ES / Biomedical-Word-Embeddings-for-Spanish

BotCenter / spanishWordEmbeddings

manashpratim / Tweet-Classification

miladfa7 / Persian-Word-Embedding

TheSaintIndiano / Fake-News-Detection

barbaraneves / gender-bias-in-virtual-assistants

apdullahyayik / Tr-topicter

quamernasim / Role-Based-Access-Control-of-Qdrant-Vector-Database

mhmdsab / Spam-Classifier

reyeon1209 / PressCheck

sambitbhaumik / siamese-nn-sts

Vidhi1290 / Word2Vec-and-FastText-Word-Embedding-with-Gensim-in-Python

FrederickRoman / fasttextAPI

mmarouen / marabou

shamiul94 / Amazon-Review-Classifier-FastText-LSTM

mariagabv / lasolana-embeddings

quamernasim / Hindi-Language-AI-Chatbot-for-Enterprises-using-Qdrant-LangChain-Ollama-Llama3-FastText-and-MLFlow