dbrx-chat

Use Databricks Serving Endpoints for Prompt Engineering, RAG, and Fine tuned LLMs

This bot has been created using Bot Framework, it shows how to create a simple bot that accepts input from the user and pipes it to a Databricks Served model for securely hosted inferencing of LLMs. Use this for model optionality, querying your RAG architectures, or your fine tuned/pretrained models.

The only custom code from the sample echo chatbot is the dbrx.py file. For enterprise use, this will need to be extended and secured.

You can easily change the model used in the dbrx.py file. It can even be specified at runtime to let the user choose between different models.

Prerequisites

This sample requires prerequisites in order to run.

Install Python 3.6 or greater

Using a Python Virtual Env is recommended

Running the sample

Run export DATABRICKS_TOKEN=your_token
Run export DATABRICKS_URL=your_workspace_url
- your_workspace_url is in the format https://{workspace-identifier}/serving-endpoints
- eg: https://adb-984752964297111.11.azuredatabricks.net/serving-endpoints
Run pip install -r requirements.txt to install all dependencies
Run python app.py

Testing the bot using Bot Framework Emulator

Bot Framework Emulator is a desktop application that allows bot developers to test and debug their bots on localhost or running remotely through a tunnel.

Install the Bot Framework Emulator version 4.3.0 or greater from here

Connect to the bot using Bot Framework Emulator

Launch Bot Framework Emulator
Enter a Bot URL of http://localhost:3978/api/messages

About

Languages

Language:Python 100.0%

ronjoy-databricks / dbrick-bot-framework