Use GEC with latest transformer, allennlp modules

Question

Use GEC with latest transformer, allennlp modules

Jiltseb opened this issue 4 years ago · comments

I want to use GEC on a latest transformer model (v4.4.2). However, it has several module errors in gector and seems difficult to fix. I have tried using v1.5.0 of allennlp but was running into errors.

Note: There is no issue in getting the GEC work with the specified versions in requirements.txt. It is just that, I want to use it in a virtual environment with latest transformer/allennlp versions.

Any help is highly appreciated! @skurzhanskyi

Abhinav Dayal · Answer 1 · Fri May 28 2021 12:29:52 GMT+0800 (China Standard Time)

I also tried with the latest versions. It seems a lot of code is using deprecated functionality that need to be re-written.

Jilt Sebastian · Answer 2 · Tue Jul 06 2021 20:08:27 GMT+0800 (China Standard Time)

@skurzhanskyi any news on this? We are still stuck with this.

Alex Skurzhanskyi · Answer 3 · Wed Jul 07 2021 23:09:09 GMT+0800 (China Standard Time)

Hi @Jiltseb
We have plans to update transformer this months

Pierre Snell · Answer 4 · Wed Aug 25 2021 23:08:10 GMT+0800 (China Standard Time)

Hi, any update on this ?

I had to change the code to make it fit with new allennlp (I can do a PR if needed), but I'm still facing many issues while loading models or running predicitons.

I tried both 3 pretrained models and cannot make any of them to work....

Thanks in advance

Alex Skurzhanskyi · Answer 5 · Wed Aug 25 2021 23:13:05 GMT+0800 (China Standard Time)

Hi, there's a branch with the transformers==4.2.2. You can check it here:
https://github.com/grammarly/gector/tree/update_transformers_support_fasttokenizers
At the same time, pretrained models produce pure output with this code. we're in the middle of retraining the models.

Pierre Snell · Answer 6 · Thu Aug 26 2021 00:57:29 GMT+0800 (China Standard Time)

Hi @skurzhanskyi, I just tried but unfortunately, I got the same errors...

I'm trying to use directly the GecBERTModel class to integrate it into my code.
I have a higher version of allennlp but I modified imports to work with it, however most of the errors come from missing keys or bad loading of the models.

I will wait for the new release.

Have a great day

Pierre Snell · Answer 7 · Fri Aug 27 2021 00:15:06 GMT+0800 (China Standard Time)

Hi, sorry to post again...

I managed to make it work with transformer 4.6.1 and allennlp 2.6.0.

However, the output of handle_batch(my_string_sentence.split()) doesn't correct anything....
Like :

handle_batch("How ar you my firend ?".split())
[['How', 'ar', 'you', 'my', 'firend', '?']] 0

To do this I removed some @override decorators, and added the function

def as_padded_tensor_dict(
        self,
        tokens: Dict[str, List[int]],
        padding_lengths: Dict[str, int],
    ) -> Dict[str, List[int]]:
        return {
            "input_ids": torch.tensor(tokens["bert"]),
            "offsets": torch.tensor(tokens["bert-offsets"]),
        }

in tokenizer_indexers.py which replaces the old pad_token_sequence as it seems.

I also add to remove the mask in the seq2labels_model as it was always true and not of the same size...

I know I did some hackish things and was hoping it could work as I don't know the codebase.

I hope it can help you and that we can have a really nice open-source state of the art grammar corrector (which we can train in other languages) :)

Have a great day

Jilt Sebastian · Answer 8 · Wed Sep 22 2021 16:41:50 GMT+0800 (China Standard Time)

@skurzhanskyi Any update on the release with the new retrained models?

Alex Skurzhanskyi · Answer 9 · Thu Oct 07 2021 00:42:25 GMT+0800 (China Standard Time)

Hi @Jiltseb
Sorry for the late reply.
Unfortunately, we've got problems getting the same quality of models with the branch code. So we cannot move to it completely.
In case you don't need the pretrained model, you can try using this branch.

Alex Skurzhanskyi · Answer 10 · Tue Oct 26 2021 15:54:25 GMT+0800 (China Standard Time)

Hi @Jiltseb @ierezell @abhinavdayal
We have great news, we just merged #133 new GECToR version, which now supports the latest transformers & torch, there're also new pretrained models (BERT, RoBERTa, XLNet). The scores are slightly different but still comparable.