Problem with sentence level reduction

Question

Problem with sentence level reduction

pengshancai opened this issue 7 months ago · comments

It seems when I attempted to do sentence-level reduction, the variable self.keep_leading_word, self.num_lead_words and self.mask_token were never declared. Any clarification? Thanks!

liyucheng09 · Answer 1 · Thu Jan 11 2024 06:36:17 GMT+0800 (China Standard Time)

any error message? How to reproduce your error?

pengshancai · Answer 2 · Fri Jan 12 2024 11:14:17 GMT+0800 (China Standard Time)

Guess I figured it out.
Your file selective_context.py in the GitHub repo is different from the selective_context.py in the pip install version.
If you look at the selective_context.py in this GitHub repo you will find that some variables (e.g. self.keep_leading_word) were used without declaration.

Also, it seems some libs installed together with your package are out of date. (e.g. spacy) and are not compatible with other packages.

liyucheng09 · Answer 3 · Fri Jan 12 2024 15:14:04 GMT+0800 (China Standard Time)

see this).

class SelectiveContext:

    def __init__(self, model_type = 'gpt2', lang = 'en'):

        self.model_type = model_type
        self.lang = lang
        self.device = DEVICE

        # this means we calculate self-information sentence by sentence
        self.sent_level_self_info = True

        self._prepare_phrase_tokenizer()
        self.sent_tokenize_pattern = r"(?<!\w\.\w.)(?<![A-Z][a-z]\.)(?<=\.|\?)\s"
        self.phrase_mask_token = ''
        self.sent_mask_token = "<...some content omitted.>"
        self.keep_leading_word = False
        self.mask_token = ''
        self._prepare_model()

I don't see any undeclared parameters here? What error you found exactly?

please share more info for me to improve this project, in case other users find it helpful.

pengshancai · Answer 4 · Fri Jan 12 2024 22:30:41 GMT+0800 (China Standard Time)

The undeclared variable is in this file: https://github.com/liyucheng09/Selective_Context/blob/main/selective_context.py

liyucheng09 · Answer 5 · Fri Jan 12 2024 22:47:13 GMT+0800 (China Standard Time)

I see. Thanks! Would you mind open a pr for this if you already solve it?

liyucheng09 · Answer 6 · Sun Jan 14 2024 13:57:23 GMT+0800 (China Standard Time)

Never mind, issue solved.