KennethEnevoldsen / scandinavian-sentence-transformers

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scandinavian Sentence Transformers

Black

This project is intended for training Danish, Swedish and Norwegian sentence transformers. The project is an extension of the Danish Foundation models project.

Installation

You can install scandinavian-sentence-transformers via pip:

git clone {repo url}
cd scandinavian-sentence-transformers
pip install -e .

but we recommend using invoke for the setup:

git clone {repo url}
cd scandinavian-sentence-transformers

# install invoke
pip install invoke
# setup up virtual environment and install dependencies
inv setup

Train model

To train the models you wi

inv prepare_dataset --lang da
inv train --model_name vesteinn/DanskBERT

Evaluate

pip install scandeval
scandeval --model-id dfm-sentence-encoder-medium

About

License:MIT License


Languages

Language:Python 100.0%