cin-hubert's starred repositories
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
ml-engineering
Machine Learning Engineering Open Book
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
postgresml
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
mlx-examples
Examples in the MLX framework
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
GPT-RAG
Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
DISC-MedLLM
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare services.
llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
local-llm-function-calling
A tool for generating function arguments and choosing what function to call with local LLMs
Machine-Learning-for-Imbalanced-Data
Machine Learning for Imbalanced Data, published by Packt
sandbox-topically
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
Selective_Context
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
backbone-learn
A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.