SUGIYAMA Yoshio (IMOKURI)

IMOKURI

Geek Repo

Company:@HewlettPackard

Location:Tokyo, Japan

Home Page:https://imokuri.com/

Twitter:@imokurity

Github PK Tool:Github PK Tool

SUGIYAMA Yoshio's starred repositories

k8s-dra-driver

Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes

Language:GoLicense:Apache-2.0Stargazers:177Issues:0Issues:0

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:746Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2227Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:826Issues:0Issues:0

nos

Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!

Language:GoLicense:Apache-2.0Stargazers:587Issues:0Issues:0

llm-on-openshift

Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

Language:DockerfileLicense:Apache-2.0Stargazers:67Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2258Issues:0Issues:0

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:1811Issues:0Issues:0

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaStargazers:290Issues:0Issues:0

llama-inference

experiments with inference on llama

Language:PythonStargazers:102Issues:0Issues:0

intro-llm-rag

LLM Models and RAG Hands-on guide

Language:PythonLicense:MITStargazers:193Issues:0Issues:0

ragapp

The easiest way to use Agentic RAG in any enterprise

Language:TypeScriptLicense:Apache-2.0Stargazers:2392Issues:0Issues:0

ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language:PythonLicense:NOASSERTIONStargazers:2469Issues:0Issues:0

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

Language:RustLicense:MITStargazers:105Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:554Issues:0Issues:0

cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Language:PythonLicense:Apache-2.0Stargazers:2691Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7147Issues:0Issues:0

ts-comments.nvim

Tiny plugin to enhance Neovim's native comments

Language:LuaLicense:Apache-2.0Stargazers:241Issues:0Issues:0

nvim-best-practices

Collection of DOs and DON'Ts for modern Neovim Lua plugin development

License:CC0-1.0Stargazers:196Issues:0Issues:0

GPU-Benchmarks-on-LLM-Inference

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Language:Jupyter NotebookStargazers:534Issues:0Issues:0
Language:ShellStargazers:2Issues:0Issues:0
Stargazers:4Issues:0Issues:0

ez-cheat

Unofficial Cheat Book for HPE Ezmeral Products

Stargazers:2Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:120Issues:0Issues:0

tegon

Tegon is an open-source, AI-first alternative to Jira, Linear

Language:TypeScriptLicense:MITStargazers:751Issues:0Issues:0

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:3723Issues:0Issues:0

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Language:PythonLicense:Apache-2.0Stargazers:2854Issues:0Issues:0

llm.nvim

LLM powered development for Neovim

Language:LuaLicense:Apache-2.0Stargazers:595Issues:0Issues:0

llm-ls

LSP server leveraging LLMs for code completion (and more?)

Language:RustLicense:Apache-2.0Stargazers:501Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:60050Issues:0Issues:0