Shotaro Ishihara's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:86993Issues:666Issues:6933

openai-cookbook

Examples and guides for using the OpenAI API

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:37648Issues:325Issues:3463

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14334Issues:87Issues:351

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:12338Issues:244Issues:101

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9034Issues:95Issues:619

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3611Issues:34Issues:269

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Language:PythonLicense:Apache-2.0Stargazers:3091Issues:30Issues:325

awesome-local-ai

An awesome repository of local AI tools

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonLicense:NOASSERTIONStargazers:627Issues:13Issues:38

embetter

just a bunch of useful embeddings

Language:PythonLicense:MITStargazers:438Issues:8Issues:51

relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:411Issues:8Issues:16

starting-kit

Starting kit for the NeurIPS 2023 unlearning challenge

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:374Issues:22Issues:8

GPTSAN

General-purpose Swich transformer based Japanese language model

Language:PythonLicense:MITStargazers:114Issues:5Issues:9

HojiChar

The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.

Language:PythonLicense:Apache-2.0Stargazers:110Issues:4Issues:1

kaggle-book-gokui

付録コード

Language:PythonLicense:MITStargazers:108Issues:4Issues:16

lmppl

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Language:PythonLicense:MITStargazers:102Issues:4Issues:9

ir100

情報検索100本ノック

llm-japanese-dataset

LLM構築用の日本語チャットデータセット

SkillSpan

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Language:PerlLicense:MITStargazers:51Issues:8Issues:15

Extracting-Training-Data-from-Large-Langauge-Models

A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020

Language:PythonLicense:MITStargazers:28Issues:3Issues:0

shirokumas

A set of scikit-learn style transformers for Polars

Language:PythonLicense:MITStargazers:26Issues:2Issues:5

instruction_ja

Japanese instruction data (日本語指示データ)

Language:PythonLicense:MITStargazers:22Issues:3Issues:0

semantic-shift-stability

implementation of Semantic Shift Stability (AACL2022)

Language:PythonLicense:MITStargazers:14Issues:25Issues:0
Language:PythonLicense:MITStargazers:12Issues:1Issues:0
Language:C++License:MITStargazers:4Issues:0Issues:0
Language:PythonLicense:MITStargazers:3Issues:1Issues:0
Stargazers:2Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0