argilla-io / argilla-plugins

🔌 Open-source plugins for with practical features for Argilla using listeners.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

`paraphrase` text of less represented records

davidberenstein1957 opened this issue · comments

Automatically paraphrase text from records that are less represented within the entire dataset.

MVP

from argilla_plugins.inference import paraphraser.

paraphraser(name="dataset", model="model_name", vector_name="vector", sim_threshold=0.9)
paraphraser.start()

Stretch
filtering variables like query could be added to limit the sync.