Mario Šaško's repositories
datasets_sql
An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets
dask
Parallel computing with task scheduling
Language:PythonBSD-3-Clause000
filesystem_spec
A specification that python filesystems should adhere to.
gcsfs
Pythonic file-system interface for Google Cloud Storage
Language:PythonBSD-3-Clause000
tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:PythonMIT000
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.