There are 2 repositories under lexical-semantics topic.
[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
OpenWordnet-PT: an open access wordnet for Portuguese
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
Data Sets and Models for Evaluation of Lexical Semantic Change Detection
An R-based guide to sampling Google n-gram data, building historical term-feature matrices & investigating lexical semantic change historically.
The implementation for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)
Data for the DiMSUM shared task at SEMEVAL 2016
Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference
Resources developed by and for the project REACTION (Retrieval, Extraction and Aggregation Computing Technology for Integrating and Organizing News) an initiative for developing a computational journalism platform (mostly) for Portuguese.
A systematic NLP framework that uses diachronic word embeddings to trace semantic shifts or variations in the context of words over time in the Greek language.
Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)
A web service that exposes semantic similarity search via a web GUI and a RESTful API.
[ACL 2024] TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Sematic Tasks
An application for extracting certain data from BabelNet.
Creates a Neo4j graph database from Gavagai Living Lexicon entries
A system for inducing distributional sense-aware semantic classes labeled with hypernyms
Supplementary data for the COLING 2018 paper "Automatically Creating a Lexicon of Verbal Polarity Shifters: Mono- and Cross-lingual Methods for German" by Schulder, Wiegand and Ruppenhofer.
Supplementary data for the LREC 2018 paper "Introducing a Lexicon of Verbal Polarity Shifters for English" by Schulder, Wiegand, Ruppenhofer and Köser.
The code and data for "Understanding Jargon: Combining Extraction and Generation for Definition Modeling" (EMNLP '22)
A corpus of supersense-annotated adpositions and case markers in German natural-language text.
Path-based hypernym prediction models in WordNet (WN18RR-hp)
Hindi SNACS (Semantic Network of Adposition and Case Supersenses; Schneider et al., 2018) annotation scheme and guidelines.
Visualizations of diachronic word embeddings
Implementation of Lexical Analyzer ( Scanner ) part of compiler
Source code for DURel Annotation Tool
Complex Lexical Analyzer Using Python 3