Auto-AbDab is a Pathogen-Specific Automated Antibody Database Builder.
The pipeline works as follows:
- The user inputs keywords related to a disease. (Eg. 'SARS-CoV-2', 'COVID-19', 'coronavirus', 'SARS-CoV', 'MERS-CoV', and 'SARS'.)
- The pipeline scrapes the following for antibodies:
- the full-texts and supplementary data of publications associated with PubMed or BioRxiv,
- the Protein Data Bank (PDB),
- the National Genetic Sequence Data Base (GenBank),
- Patents.
- The pipeline obtains biological information for each antibody such as its sequence, germline, and structure using ANARCI and SAbDab.
- The pipeline returns the pathogen-specific antibody database to the user.
The pipeline is demonstrated for SARS-CoV-2, though antibody databases for other pathogens may be generated as well.