Dan Lou's repositories

LMMS

Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings

Language:PythonLicense:NOASSERTIONStargazers:92Issues:5Issues:4

MedLinker

ECIR 2020 - MedLinker: Medical Entity Linking with Neural Representations and Dictionary Matching

safespace

Your local AI counselor. LLM app that runs offline from a single binary.

Language:PythonLicense:MITStargazers:27Issues:1Issues:3

telegram-reddit

Unofficial Telegram Bot for Reddit

Language:PythonLicense:MITStargazers:23Issues:3Issues:0

bert-disambiguation

Code and CoarseWSD-20 datasets for "Language Models and Word Sense Disambiguation: An Overview and Analysis"

Language:PythonStargazers:22Issues:3Issues:0

dspt8

Jupyter Notebook about Exploring Embeddings for NLP used in DSPT #8

Language:Jupyter NotebookLicense:MITStargazers:2Issues:1Issues:0

MedLinker-Social

UMLS Medical Entity Linking. Adaptation of MedLinker (ECIR 2020) for the Social Domain.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:1Issues:1

BERT-related-papers

BERT-related papers

Stargazers:0Issues:1Issues:0

bertram

This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

danlou.github.io

Personal page

Stargazers:0Issues:2Issues:0

DEKCOR-CommonsenseQA

Official code for paper "Fusing Context Into Knowledge Graph for Commonsense QuestionAnswering"

License:MITStargazers:0Issues:0Issues:0

freqpickle

Pickled word frequencies extracted from wordfreq for direct usage

Language:PythonStargazers:0Issues:1Issues:0

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pelican-blog

Pelican generator for danlou.github.io

Language:MakefileStargazers:0Issues:0Issues:0

pelican-hyde

Pelican theme based on Hyde Jekyll theme

Language:CSSLicense:MITStargazers:0Issues:0Issues:0

scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

synbert

Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary

License:MITStargazers:0Issues:1Issues:0

timelms

TimeLMs

Language:PythonStargazers:0Issues:0Issues:0

UWA

Word Sense Disambiguation (WSD) dataset - Unambiguous Word Annotations (UWA)

License:MITStargazers:0Issues:2Issues:0

wn-lists

Useful lists generated from WordNet data

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ytdata

Obtain video details for YouTube channels.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0