Mechanism for rebuilding database file from scratch

Question

Mechanism for rebuilding database file from scratch

simonw opened this issue 2 years ago · comments

Simon Willison commented 2 years ago

Will need this to fix:

#2
#4

Simon Willison · Answer 1 · Tue Nov 22 2022 02:01:01 GMT+0800 (China Standard Time)

Can copy this pattern:

https://github.com/simonw/datasette.io/blob/5455068f5ffdb8cd3f09a4d84d94b7512a46b18e/.github/workflows/deploy.yml?q=%22if%3A%22+user%3Asimonw+path%3A.github%2Fworkflows%2F*.yml#L34-L37

on:
  workflow_dispatch:
    inputs:
      from_scratch:
        description: Enter 'skip' to create a new database from scratch

    - name: Download previous content.db
      if: github.event.inputs.from_scratch != 'skip'
      run: |
        curl -O https://datasette.io/content.db

Simon Willison · Answer 2 · Tue Nov 22 2022 02:04:22 GMT+0800 (China Standard Time)

I also want to be sure that things don't get weird if I'm trying to run a "rebuild" task but one of the scheduled tasks kicks in and runs at the same time.

https://docs.github.com/en/actions/using-jobs/using-concurrency can help there:

concurrency: scraper

Simon Willison · Answer 3 · Tue Nov 22 2022 02:29:57 GMT+0800 (China Standard Time)

It's not working right:

[{"table": "namespaces", "count": 1},
 {"table": "commits", "count": 2},
 {"table": "item", "count": 1},
 {"table": "item_version", "count": 2},
 {"table": "columns", "count": 3},
 {"table": "item_changed", "count": 5}]

I think because I need to checkout the full repo history, not the default shallow checkout.

with:
  fetch-depth: 0

Simon Willison · Answer 4 · Tue Nov 22 2022 02:34:34 GMT+0800 (China Standard Time)

This works correctly now.