MiguelPeralvo / large-language-models

Notebooks for Large Language Models (LLMs) Specialization

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Large Language Models

This repo contains the notebooks for the Large Language Models: Application through Production course on edX & Databricks Academy.

How to Import the Repo into Databricks?

  1. You first need to add Git credentials to Databricks. Refer to documentation here.

  2. Click Repos in the sidebar. Click Add Repo on the top right.

    repo_1
  3. Clone the "HTTPS" URL from GitHub, or copy https://github.com/databricks-academy/large-language-models.git and paste into the box Git repository URL. The rest of the fields, i.e. Git provider and Repository name, will be automatically populated. Click Create Repo on the bottom right.

    add_repo

Which Databricks cluster should I use?

  1. First, select Single Node

    single_node
  2. This courseware has been tested on Databricks Runtime 13.1 for Machine Learning. If you do not have access to a 13.1 ML Runtime cluster, you will need to install many additional libraries (as the ML Runtime pre-installs many commonly used machine learning packages), and this courseware is not guaranteed to run.

    cluster

    For all of the notebooks except LLM 04a - Fine-tuning LLMs and LLM04L - Fine-tuning LLMs Lab, you can run them on a CPU just fine. We recommend either i3.xlarge or i3.2xlarge (i3.2xlarge will have slightly faster performance).

    cpu_settings

    For these notebooks: LLM 04a - Fine-tuning LLMs and LLM04L - Fine-tuning LLMs Lab, you will need the Databricks Runtime 13.1 for Machine Learning with GPU.

    gpu

    Select GPU instance type of g5.2xlarge.

    gpu_settings

About

Notebooks for Large Language Models (LLMs) Specialization

License:Other


Languages

Language:Python 100.0%