docs on how to apply DeepRC to BCR

Question

docs on how to apply DeepRC to BCR

antonkulaga opened this issue 3 years ago · comments

Unfortunately, in the paper I did not understand if I can make a training dataset from BCR AIRR seq. Could you clarify in the readme and/or manuscript if it is possible and if yes if any additional steps (in comparison with TCR-s) are needed

Michael Widrich · Answer 1 · Thu Apr 15 2021 22:01:02 GMT+0800 (China Standard Time)

Hi! The implementation in the repo is rather general and should work with any kind of repertoires, as long as you can provide it in a suitable text-based format (please see https://github.com/ml-jku/DeepRC/blob/master/deeprc/datasets/README.md for the expected text-based data format).
In the paper we only conducted experiments with TCR data or simulated repertoire data but I would expect it to work similarly well for BCR data if you have a large enough dataset.
I would recommend to start by using https://github.com/ml-jku/DeepRC/blob/master/deeprc/examples/example_single_task_cnn.py with your dataset. Your text-based dataset will be automatically compressed to a hdf5 file for performance reasons. Preprocessing shouldn't be required.
Did this answer your questions? If yes, I will incorporate this into the readme.

Michael Widrich · Answer 2 · Fri May 28 2021 16:48:49 GMT+0800 (China Standard Time)

marked as resolved since no response