allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Home Page:https://allenai.github.io/dolma/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Progress Bar may use more resources than necessary

soldni opened this issue · comments

from @dirkgr:

When I have a console window open that watches dolma tokens, it literally takes 1.5 cores permanently on my Mac.