Jan

Jan's repositories

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptAGPL-3.022788 127 1824

cortex.cpp

Run and customize Local LLMs.

Language:C++Apache-2.01967 16 627

cortex.tensorrt-llm

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

Language:C++Apache-2.039 2 20

docs

Jan.ai Website & Documentation

Language:MDX20 6 103

model-converter

Language:Python19 2 3

cortex.llamacpp

cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.

Language:C++AGPL-3.016 5 18

extension-template

Language:TypeScript8 3 2

cortex.so

Language:TypeScript5 2 65

cortex.onnx

Language:C++4 4 3

models

Language:PythonMIT4 2 14

cortex.js

The official Node.js / Typescript library for the OpenAI API

Language:TypeScriptApache-2.0300

cortex.python

C++ code that run Python embedding

Language:C++AGPL-3.03 2 3

cortex.py

The official Python library for the OpenAI API

Apache-2.0200

tokenizer.cpp

Language:C++AGPL-3.0200

architecture

1 10

homebrew-cortexso

Language:Ruby100

node-nitro

Language:JavaScriptAGPL-3.01 50

triton_tensorrt_llm

Language:Shell1 20

py-nitro

020

cortex.audio

Language:C++000

homebrew-tap

Language:Ruby000

infinity

The AI-native database built for LLM applications, providing incredibly fast vector and full-text search

Language:C++Apache-2.0000

langchainjs

Language:TypeScriptMIT010

llama.cpp-avx-vnni

Port of Facebook's LLaMA model in C/C++

Language:C++MIT010

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

Language:RustMIT010

ppa

AGPL-3.0000

pymaker

Make the py

000

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.0010

trt-llm-as-openai-windows

This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.

Language:PythonNOASSERTION010

winget-pkgs

The Microsoft community Windows Package Manager manifest repository

MIT000