gemma

There are 12 repositories under gemma topic.

ollama / ollama
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
llama llm llama2 llms go golang ollama mistral gemma llama3 llava phi4 deepseek gemma3 qwen gemma3n gpt-oss
Language:Go 152347
unsloth
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
fine-tuning llama llms lora mistral gemma llama3 unsloth llm deepseek deepseek-r1 gemma3 text-to-speech tts qwen qwen3 agent ai openai gpt-oss
Language:Python 45475
LocalAI
mudler / LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
llama rwkv ai llm stable-diffusion api kubernetes gpt4all tts musicgen mamba audio-generation image-generation text-generation gemma mistral llama3 rerank distributed libp2p
Language:Go 35288
GaiZhenbiao / ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
chatbot chatgpt-api chatglm claude dalle3 ernie gemini gemma llama midjourney minimax moss ollama qwen spark stablelm inspurai
Language:Python 15437
xorbitsai / inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
ggml pytorch chatglm deployment flan-t5 llm wizardlm artificial-intelligence machine-learning whisper inference openai-api mistral gemma llama llamacpp vllm qwen llama3 glm4
Language:Python 8525
LostRuins / koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
koboldcpp llamacpp llm koboldai llama ggml gguf gemma language-model mistral
Language:C++ 8193
yangjianxin1 / Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
gpt alpaca llm baichuan llama lora qlora peft llama2 internlm chatglm qwen aquila mistral mixtral zephyr minicpm gemma llama3 qwen2
Language:Python 6546
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
gemma google pytorch
Language:Python 5545
paperless-ai
clusterzx / paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.
ai automation paperless paperless-ngx gemma llama mistral ollama phi
Language:JavaScript 4202
Sidekick
johnbean393 / Sidekick
A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.
ai llama llm macos rag swift swiftui deepseek deepseek-r1 aichat chatbot qwen llama4 qwen3 agentic-ai ai-agents gemma3 agents deep-research
Language:Swift 3074
elia
darrenburns / elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
python ai chatgpt gpt terminal tui claude large-language-models llama llama3 llm mistral mixtral ollama gemma mistral-ai ollama-client ollama-interface phi-3
Language:Python 2287
google / generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
ai chatbot documentation embeddings llm machine-learning gemini gemini-api gemma
Language:Jupyter Notebook 2146
papersgpt / papersgpt-for-zotero
A powerful Zotero AI plugin with ChatGPT, Gemini, Claude, Grok, DeepSeek, OpenRouter, Kimi, GLM, SiliconFlow, GPT-oss, Gemma 3, Qwen 3
zotero mistral claude chatgpt llama ai chat pdf deepseek gemini qwen3 grok4 gpt-5 gpt-oss glm kimi openrouter siliconflow gemma3
Language:JavaScript 1862
nextjs-ollama-llm-ui
jakobhoeg / nextjs-ollama-llm-ui
Fully-featured web interface for Ollama LLMs
ai chatbot llm mistral nextjs ollama openai tailwindcss localstorage offline shadcn react local nextjs14 typescript mistral-7b gemma
Language:TypeScript 1315
gemma-cookbook
google-gemini / gemma-cookbook
A collection of guides and examples for the Gemma open models from Google.
codegemma gemma paligemma recurrentgemma
Language:Jupyter Notebook 1304
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
alignment llama2 llama3 llm nlp paper phi3 qwen2 synthetic-data synthetic-dataset-generation dataset gemma supervised-finetuning
Language:Python 768
mlc-ai / web-llm-chat
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
ai chat chat-application chatbot gemma generative-ai hermes large-language-models llama llm mistral phi2 privacy redpajama tinyllama nextjs qwen chatgpt webgpu
Language:TypeScript 710
aikit
sozercan / aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
ai buildkit chatgpt docker fine-tuning finetuning gemma gpt inference kubernetes large-language-models llama llm localllama mistral mixtral nvidia open-llm open-source-llm openai
Language:Go 440
InternLM / InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3
Language:Python 407
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
gemma gpt gpu inference jax large-language-models llama llama2 llm model-serving pytorch tpu llm-inference llmops mlops transformer
Language:Python 375
Beomi / InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
gemma huggingface pytorch transformers infinitransformer llama llama3
Language:Python 359
Picovoice / picollm
On-device LLM Inference Powered by X-Bit Quantization
compression efficient-inference gemma generative-ai language-model language-models large-language-model llama llama2 llama3 llm llm-inference llms mistral mixtral model-compression natural-language-processing quantization self-hosted
Language:Python 268
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
llama2 llamacpp llm-inference model-quantization multi-gpu-inference mixture-of-experts moe gemma falcon minicpm mistral bloom deepseek internlm phi-2 baichuan2 mixtral m2m100 qwen
Language:C++ 248
masterFoad / NanoSage
Local LLM Powered Recursive Search & Smart Knowledge Explorer
ai-researcher gemma knowledge-base local monte-carlo-search ollama python3 rag recursive-search report small-language-models nanosage algorithms cli
Language:Python 232
Genta-Technology / Kolosal
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
llm c cpp deepseek gemma gemma2 gemma3 gpt llama llama2 llama3 llamacpp llava llms localai mistral phi3 phi4 qwen self-hosted
Language:C++ 183
googlegpt
KudoAI / googlegpt
🤖 Adds AI chat and search summaries to Google Search, powered by the latest LLMs like Google Gemma + GPT-4o!
ai bot chatbot chatbots chatgpt experimental generative-ai google gpt gpt-4 llm machine-learning openai search search-engine greasemonkey userscripts gemma gpt-4o kudoai
Language:JavaScript 163
jorge-armando-navarro-flores / chat_with_your_docs
Discover and converse with advanced AI models like Mistral, LLAMA2, and GPT-3.5 from leading sources like OLLAMA, Hugging Face, and OpenAI. Easily extract insights from PDFs, web pages, and YouTube videos with our intuitive interface. Unlock the power of knowledge with seamless chat interactions.
faiss gemma gpt-3-5-turbo gpt-4 gradio huggingface llama2 llms mistral ollama openai python docs pdf web youtube chatbot retrieval-chatbot langchain vectorstore
Language:Python 147
ganeshnikhil / J.A.R.V.I.S.2.0
open source assistant using small models (2b - 5b) , with agentic and tool calling capabilities and integration of RAG with effiecient memory.android support using adb
ai api gemini granite-20b-multilingual jarvis-assistant llm llm-agents ollama rag tools gemma
Language:Python 124
marklysze / LlamaIndex-RAG-WSL-CUDA
Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
llama-2 llamaindex mistral-7b orca-2 retrieval-augmented-generation yi-34b windows-10 windows-11 wsl2 mixtral mixtral-8x7b phi-2 microsoft-phi-2 neural-7b neural-chat-7b gemma gemma-2b gemma-7b
Language:Jupyter Notebook 124
Mobile-Artificial-Intelligence / llama_sdk
lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
facebook flutter-ai gemma ggml gguf llama llama2 llamacpp llm llm-inference local-ai meta mistral mixtral mobile-ai
Language:C++ 102
BodhiSearch / BodhiApp
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs
gemma generative-ai llama llm local-llm localllm mistral open-source-llm private-llm
Language:Rust 97
Upsonic / Client
Self-Driven Autonomous Python Libraries
autonomous gemma library module openai python
Language:Python 94
fly-apps / ollama-open-webui
Self-host a ChatGPT-style web interface for Ollama 🦙
gpu ollama ollama-webui llama3 mixtral gemma llava mistral ai
Language:Shell 84
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.
gemma paligemma vlm
Language:Python 80
ErickWendel / ollama-webui-traefik-docker
deepseek-r1 docker gemma gemma-2b hostinger letsencrypt llm llms ollama open-webui traefik vps
Language:Shell 75
BlahST
QuantiusBenignus / BlahST
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs.
accessibility ai bloat-free cli command-line command-line-tool desktop-integration gemma gnome kiss llama llm llms machine-learning no-nonsense speech-recognition speech-to-text tts whisper whisper-cpp
Language:Shell 68

gemma

ollama / ollama

unslothai / unsloth

mudler / LocalAI

GaiZhenbiao / ChuanhuChatGPT

xorbitsai / inference

LostRuins / koboldcpp

yangjianxin1 / Firefly

google / gemma_pytorch

clusterzx / paperless-ai

johnbean393 / Sidekick

darrenburns / elia

google / generative-ai-docs

papersgpt / papersgpt-for-zotero

jakobhoeg / nextjs-ollama-llm-ui

google-gemini / gemma-cookbook

magpie-align / magpie

mlc-ai / web-llm-chat

sozercan / aikit

InternLM / InternEvo

AI-Hypercomputer / JetStream

Beomi / InfiniTransformer

Picovoice / picollm

inferflow / inferflow

masterFoad / NanoSage

Genta-Technology / Kolosal

KudoAI / googlegpt

jorge-armando-navarro-flores / chat_with_your_docs

ganeshnikhil / J.A.R.V.I.S.2.0

marklysze / LlamaIndex-RAG-WSL-CUDA

Mobile-Artificial-Intelligence / llama_sdk

BodhiSearch / BodhiApp

Upsonic / Client

fly-apps / ollama-open-webui

adithya-s-k / YoloGemma

ErickWendel / ollama-webui-traefik-docker

QuantiusBenignus / BlahST