TextGenerationOptions is totally not used

Question

TextGenerationOptions is totally not used

AsakusaRinne opened this issue 8 months ago · comments

TextGenerationOptions is a parameter of ITextGeneration.GenerateTextAsync. However currently it seems to be not used anywhere.

For some API service like OpenAI Chatgpt, stop sequence is not so important. However for local model inference, the model will endlessly generate response without a stop sequence.

Could you please expose TextGenerationOptions to AskAsync API to let users configure the settings themselves? It will help a lot for local LLM inference integration.

xbotter · Answer 1 · Mon Nov 06 2023 14:33:28 GMT+0800 (China Standard Time)

If possible, I hope that the method of calculating the number of tokens can also provide custom configuration.

Rinne · Answer 2 · Tue Nov 14 2023 17:51:03 GMT+0800 (China Standard Time)

Any updates? @dluc I understand that at the beginning stage of a project, there's always short of hands. Please at least let us know if it would be solved in the future.

Devis Lucato · Answer 3 · Fri Nov 17 2023 08:40:42 GMT+0800 (China Standard Time)

Sorry we didn't have an opportunity to look into this yet, but we always keep an eye on the list of open issues, so we'll provide an update as soon as possible.

Rinne · Answer 4 · Fri Nov 17 2023 17:14:27 GMT+0800 (China Standard Time)

Ok, I'm looking forward to it. Thank you for your works anyway.

Devis Lucato · Answer 5 · Sat Dec 09 2023 07:47:05 GMT+0800 (China Standard Time)

I noticed that LLama would generate tokens ad infinitum (almost, at some point it throws an exception). SearchClientConfig.AnswerTokens will be passed as TextGenerationOptions.MaxTokens.

I'll look into adding the options to the Ask API, so the behavior can be managed more easily.

Rinne · Answer 6 · Sat Dec 09 2023 22:48:24 GMT+0800 (China Standard Time)

Thank you a lot! I'm looking forward to it