Customizing training data for fine-tuning the model

Question

sumin5784 opened this issue 3 years ago · comments

Hello,

I'm trying to use my own dataset to fine-tune DNABERT-5/6.
I have several questions about this.

All input sequence length should be the same? Or input sequences can be different length?
Does label should be 1/0? Basically, does classification class can be more than two classes?

Any feedback would be appreciated.
Thank you!

Peng_Lee · Answer 1 · Thu Nov 10 2022 20:36:04 GMT+0800 (China Standard Time)

Excuse me, do you understand these two questions？