d33bs / demo-dvc-possible-push-bug

Demonstrating a DVC bug with pushing data to remotes in certain circumstances.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

demo-dvc-possible-push-bug

Demonstrating a possible DVC bug with pushing data to remotes in certain circumstances.

Installation

Please use Python poetry to run and install related content. The Poetry environment for this project includes dependencies which help run IDE environments, manage the data, and run workflows.

# after installing poetry, create the environment
poetry install

Poe the Poet

Use Poe the Poet to define and run tasks defined within pyproject.toml under the section [tool.poe.tasks*]. This allows for the definition and use of a task workflow when implementing multiple procedures in sequence.

For example, use the following to run the dvc_possible_bug task:

# run data_prep task using poethepoet defined within `pyproject.toml`
poetry run poe dvc_possible_bug

Output

There are two files which may help demonstrate the findings: dvc_list_output_1.txt and dvc_list_output_2.txt.

  • dvc_list_output_1.txt: shows a listing of files within the dir after data generation, dvc add, and dvc push.
  • dvc_list_output_2.txt: shows a listing of files within the dir after data removal, and dvc pull.

About

Demonstrating a DVC bug with pushing data to remotes in certain circumstances.

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Python 100.0%