David S. Batista's repositories
Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
NER-Evaluation
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
Aspect-Based-Sentiment-Analysis
Aspect-Based Sentiment Analysis Experiments
text-classification
An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines
ConvNets-for-Sentence-Classification
"Convolutional Neural Networks for Sentence Classification" (Kim 2014) - https://www.aclweb.org/anthology/D14-1181
machine-learning-notebooks
Assorted exercises and proof-of-concepts to understand and study machine learning and statistical learning theory
REACTION-resources
Resources developed by and for the project REACTION (Retrieval, Extraction and Aggregation Computing Technology for Integrating and Organizing News) an initiative for developing a computational journalism platform (mostly) for Portuguese.
SLANG-Sequence-LAbeliNG
Sequence LAbeliNG with Neural Networks: "Neural Architectures for Named Entity Recognition" (Lample et al., 2016) and "End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF" (Ma, 2016)
Toponym-Disambiguation-Using-Ontology-Based-Semantic-Similarity
Toponym Disambiguation using Ontology-based Semantic Similarity.
GermEval-2019-Task_1
GermEval 2019 Task 1 - Shared Task on Hierarchical Classification of Blurbs
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
davidsbatista.net
my personal homepage and blog
politiquices
Explore relações de apoio e oposição, entre personalidades políticas, expressas em títulos de notícias preservadas no arquivo.pt
Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
chilosopher.com
webpage for my musical experiments
jena-docker
Docker image for Apache Jena riot
Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python