Leon Derczynski (leondz)

leondz

Geek Repo

Company:NVIDIA · ITU Copenhagen

Location:Copenhagen · Seattle

Home Page:https://www.derczynski.com

Twitter:@leonderczynski

Github PK Tool:Github PK Tool


Organizations
ITUnlp

Leon Derczynski's repositories

emerging_entities_17

Dataset for the Emerging & Novel Entity NER task (WNUT '17)

dagw_page

The Danish Gigaword project

generalised-brown

C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)

Language:C++Stargazers:2Issues:2Issues:0

iu-nlpml

Innopolis Natural Language Processing & Machine Learning

Language:Jupyter NotebookStargazers:2Issues:3Issues:0

awesome-danish

A curated list of awesome resources for Danish language technology

Language:PythonStargazers:1Issues:3Issues:0

medtermfilter

Medical term filtering for twitter capture archives

Language:PythonStargazers:1Issues:2Issues:0

authorless-tms

Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"

Language:PythonStargazers:0Issues:1Issues:0

blog

Public repo for HF blog posts

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

branchLSTM

branchLSTM model from Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

carbontracker

Track and predict the energy consumption and carbon footprint of training deep learning models.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Creative-Commons-Markdown

Markdown-formatted Creative Commons licenses

Stargazers:0Issues:1Issues:0

danlp

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

fastText

Library for fast text representation and classification.

Language:HTMLLicense:MITStargazers:0Issues:3Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

huggingface_hub

All the open source things related to the Hugging Face Hub.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

itu-algorithms.github.io

Web pages for ITU Algorithms research group

Language:HTMLStargazers:0Issues:2Issues:0

iu-misinfo

Innopolis Misinformation course

Language:Jupyter NotebookStargazers:0Issues:3Issues:0

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

License:MITStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

pheme-twitter-conversation-collection

Twitter conversation collection script, which collects all replies to a given tweet

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

RWKV-LM

RWKV v2 is a RNN with transformer-level performance. It can be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

RWKV-v2-RNN-Pile

RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:2Issues:0

slurm

yet another network load monitor

Language:CLicense:GPL-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

zfs

ZFS on Linux - the official OpenZFS implementation for Linux.

Language:CLicense:NOASSERTIONStargazers:0Issues:2Issues:0