ltgoslo / talk-of-norway

This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Huggingface

simeneide opened this issue · comments

Great dataset, think it will do exactly what I want!

Do you have any plans on publishing it as a huggingface dataset?

Thanks for the interest! We have no plans of publishing the data on Huggingface at the moment, unfortunately.

@erikve has mentioned the possibility for updating the data in the future(?)

Ok. just wanted to add that comment here as I almost didnt find this dataset. its not too hard and great for discoverability :) I could be happy to help out