transformers-stream-generator

Description

This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.

Web Demo

original
stream

Installation

pip install transformers-stream-generator

Usage

just add two lines of code before your original code

from transformers_stream_generator import init_stream_support
init_stream_support()

add do_stream=True in model.generate function and keep do_sample=True, then you can get a generator

generator = model.generate(input_ids, do_stream=True, do_sample=True)
for token in generator:
    word = tokenizer.decode(token)
    print(word)

Example

run python script example by gpt2
run web example by gpt2 and test in client example

About

This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.

MIT License

Languages

Language:Python 99.8%Language:Shell 0.2%