Parallel decoding
jhen0409 opened this issue · comments
llama.cpp now supports parallel decoding in one context so we can support.
Breaking change: Deprecate stopCompletion
method and move to return values of completion.
React Native binding of llama.cpp
jhen0409 opened this issue · comments
llama.cpp now supports parallel decoding in one context so we can support.
Breaking change: Deprecate stopCompletion
method and move to return values of completion.