Android: Cannot load models, stopCompletions not working.

Question

Android: Cannot load models, stopCompletions not working.

Vali-98 opened this issue 6 months ago · comments

Vali-98 commented 6 months ago

As it says on the tin. Loading small 3b models ala Tiny Llama or StableLM models do not work. Tested models:

Attempting to call initLlama results in

Error: Failed to initialize context

Which I can only assume is here:

https://github.com/mybigday/llama.rn/blob/main/android/src/main/java/com/rnllama/RNLlama.java?plain=1#L55

I do not know enough about native functions to investigate further.

In addition, stopCompletions() does not stop a completion on Android.
Thanks for your work, the project is fantastic otherwise.

Jhen-Jie Hong · Answer 1 · Wed Dec 27 2023 13:59:31 GMT+0800 (China Standard Time)

Can you share the following info? Thanks!

llama.rn version (latest: 0.3.0-rc.9)
Used model quantized type or file size
Android device version & arch & total memory

Also you can use adb logcat -s "RNLLAMA_LOG_ANDROID","RNLLAMA_ANDROID_JNI" to get logs.

Vali-98 · Answer 2 · Wed Dec 27 2023 16:59:55 GMT+0800 (China Standard Time)

Sure thing:

llama.rn version: 0.3.0-rc.9
Models used & quantizations:
Not working:
-- phi-2.Q3_K_M.gguf
-- stablelm-zephyr-3b.Q3_K_L.gguf
Working:
-- tinyllama-1.1b-chat-v0.3.Q3_K_L.gguf
Android Devices Tested on:
-- Samsung A71 - Android 13 - 8GB
-- Emulated Pixel 3a - Android 14 - 8GB allocated
logcat log:

RNLLAMA_ANDROID_JNI: [RNLlama] is_model_loaded false
RNLLAMA_ANDROID_JNI: [RNLlama] is_model_loaded false
RNLLAMA_ANDROID_JNI: [RNLlama] is_model_loaded false

Vali-98 · Answer 3 · Wed Dec 27 2023 17:24:42 GMT+0800 (China Standard Time)

After further testing, this seems to be caused by an older version of the package being installed instead. Purging cache and downloading latest fixed the issue.

A bit of an embarassing mistake by me, but I will leave this up for future reference. Thank you for your time.