allenai / smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.

Home Page:https://pypi.org/project/smashed

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

allenai/smashed Issues