graykode / ALBERT-Pytorch

Thank you for implementation of ALBERT by pytorch.
I have a question about the model construction.

Line 198 in 7cac177

h = self.transformer(input_ids, segment_ids, input_mask)

Why is the number of transformer layers one ?
Is this correct ?

Please see

Line 175 in 7cac177

for _ in range(self.n_layers):

I hope this code hell you, Thanks

sorry, i missed it...
Thank you for answering to my question.

Number of Transformer layers