There are 8 repositories under unsloth topic.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Code for Deep Learning for Modern AI
数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性的医学问答系统。
Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-edge parameter-efficient fine-tuning with Unsloth integration.
Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋
AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.
Cloning Yourself using your whatsapp chat history and training a model on it.
Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.
Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on custom datasets.
PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and store information.
Finetune Web UI is a user-interface for training and deploying pre-trained models.
Materials for CSE Summer School Hackathon 2024
Finetuning of Gemma-2 2B for structured output
Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI
LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.
ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, ResurrectAI enables users to interact with AI avatars of iconic figures, gaining access to their insights, guidance, and philosophical teaching in realtime
LLM finetuning for Sudoku solving
PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.
🤖 AI of Pwo fine tuned on Llama-3.1-8B Instruct
This project demonstrates the process of fine-tuning the Qwen2.5-3B-Instruct model using GRPO (Generalized Reward Policy Optimization) on the GSM8K dataset.
⚕️ Multi-agent SLM system for medical question answering — domain-aware, private-by-design, cloud-free
Inspired by mathematical reasoning models like DeepSeekMath, this framework applies CoT to SQL generation and fine-tunes distilled models using GRPO to enhance both accuracy and interpretability.
This project is an AI-powered Contract Risk Assessment and Legal Assistant designed to analyze legal documents, extract key clauses, assess risks, and provide actionable recommendations. Additionally, a fine-tuned conversational chatbot is integrated for interactive legal Q&A based on contract-specific knowledge.
Fine-tuning GPT-3.5 and Llama3 LLMs for enhanced persona consistency in chatbots using Google's Synthetic Persona Chat dataset
we finetune unsloth llama model to extract mathematical fomulas in the images with optical character recognition(OCR)
In this we finetune Pixtral-12B-2409 model using unsloth for visual Question Answering(NLP Task)
Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b to generate JSON formats for arithmetic questions and process the output to perform calculations.
Open Source Verilog Copilot: Fine Tune LLM: Fine-Tuning LLM with QLoRA and VeriGen Dataset using Unsloth
This project demonstrates how to setup a complete Retrieval Augmented Generation (RAG) pipeline on medical data using Llama-3-8B model
DL24-25 project. The goal is Fine-Tuning a LLM on Italian Dialect.
GPT-OSS-20B fine-tuned for multilingual reasoning with LoRA (trained on Google Colab GPU). Trained on 1k Multilingual-Thinking samples across multiple languages Features 4-bit quantization and chain-of-thought reasoning. Optimized with Unsloth for efficient training.