SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
Home Page:https://pypi.org/project/smashed
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool