JaynouOliver / transtokenizers

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

transtokenizers

pypi python Build Status codecov

Token translation for language models

Features

  • TODO

Usage

from transtokenizers import transform_model
from transformers import AutoTokenizer, AutoModelForCausalLM

source_tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-v0.1")
target_tokenizer = AutoTokenizer.from_pretrained("pdelobelle/robbert-2023-dutch-base")

source_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-v0.1")

target_model = transform_model(source_model, source_tokenizer=source_tokenizer, target_tokenizer=target_tokenizer)

Credits

About

License:MIT License


Languages

Language:Jupyter Notebook 99.6%Language:Python 0.2%Language:JavaScript 0.2%Language:Makefile 0.0%