nebuly-ai / optimate

A collection of libraries to optimise AI model performances

Home Page:https://www.nebuly.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OptiMate

[Legacy]

This repository is now in a legacy phase and is no longer actively maintained. Although the source code is still available in the Git history, there will be no additional updates or official support.

[About Nebuly]

Our team is fully committed on creating the best user-experience platform for LLMs so that companies can understand user behavior at scale when interacting with their LLM-based products.

[About optimate]

We have open-sourced a couple of internal projects to the community, but we are not currently maintaining them. Optimate is a collection of libraries designed to help you optimize your AI models. It is an open-source project developed by Nebuly AI but is not actively maintained.

The tools available to assist you in your optimization are:

Speedster: reduce inference costs by leveraging SOTA optimization techniques that best couple your AI models with the underlying hardware (GPUs and CPUs)

Nos: reduce infrastructure costs by leveraging real-time dynamic partitioning and elastic quotas to maximize the utilization of your Kubernetes GPU cluster

ChatLLaMA: reduce hardware and data costs by leveraging fine-tuning optimization techniques and RLHF alignment

About

A collection of libraries to optimise AI model performances

https://www.nebuly.com/

License:Apache License 2.0


Languages

Language:Python 79.3%Language:Jupyter Notebook 16.8%Language:CMake 3.3%Language:Shell 0.5%Language:Dockerfile 0.2%Language:Makefile 0.0%