Plans to Support other LLMs?

Question

Plans to Support other LLMs?

DeveloperPaul123 opened this issue a year ago · comments

ChatGPT is great, but it would be nice to also have the option to use local models via something like llama.cpp. Is this something that you are interested in incorporating?

Sam · Answer 1 · Fri Sep 22 2023 05:01:56 GMT+0800 (China Standard Time)

Came to ask the same, it would be great if you didn't have to use "Open"AI's products with your data, especially when we're able to run some pretty neat and fine-tunable models locally / on our own servers now with decent APIs, libraries etc...

Ishaan Jaff · Answer 2 · Sat Sep 23 2023 01:14:26 GMT+0800 (China Standard Time)

Hi @DeveloperPaul123 @sammcj I believe we can help with this issue. I’m the maintainer of LiteLLM https://github.com/BerriAI/litellm

TLDR:
We allow you to use any LLM as a drop in replacement for gpt-3.5-turbo.
You can use our proxy server or spin up your own proxy server using LiteLLM

Usage

This calls the provider API directly

from litellm import completion
import os
## set ENV variables 
os.environ["OPENAI_API_KEY"] = "your-key" # 
messages = [{ "content": "Hello, how are you?","role": "user"}]

# openai call
response = completion(model="gpt-3.5-turbo", messages=messages)

# falcon call
response = completion(model="falcon-40b", messages=messages)

Paul T · Answer 3 · Sat Sep 23 2023 03:01:37 GMT+0800 (China Standard Time)

@ishaan-jaff

This is not something I'm interested in. I want to run the LLM locally, directly on my local machine without having to spin up my own server; even if it's a local server.

Ishaan Jaff · Answer 4 · Sun Sep 24 2023 01:07:22 GMT+0800 (China Standard Time)

@DeveloperPaul123 I'd recommend checking out ollama for this https://ollama.ai/

Oleksandr Ichenskyi · Answer 5 · Wed Mar 06 2024 14:57:22 GMT+0800 (China Standard Time)

Thanks everyone for your input! Currently, adding support for other LLMs isn't possible in ODIN. I've raised Issue #7 to track this feature request. You're welcome to contribute to implementing this feature!

Closing this PR as a duplicate of Issue #7.