Beast code in Giters

PN's repositories

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

CC0-1.0100

10x-research-culture

Research in Cultural Understanding and Biases in LLMs

000

advertools

advertools - online marketing productivity and analysis tools - in dash

Language:PythonMIT000

AIF360

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

Language:PythonApache-2.0000

ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Apache-2.0000

Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

000

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.

Language:PythonNOASSERTION000

crosscheckgpt-dev

CrossCheckGPT internal dev

000

data

Language:HTMLMIT000

data2

Text NLp data

000

dotfiles

:wrench: .files, including ~/.macos — sensible hacker defaults for macOS

MIT000

fast-forward-indexes

Efficient interpolation-based ranking on CPUs

Language:PythonMIT000

fastapi-ml-skeleton

FastAPI Skeleton App to serve machine learning models production-ready.

Language:PythonApache-2.0000

fastembed-rs

Library to generate text embeddings in Rust

Language:RustApache-2.0000

G-Retriever

Repository for G-Retriever

MIT000

GLiNER

Generalist model for NER (Extract any entity types from texts)

Language:PythonApache-2.0000

GPT4DFCI

generative AI tool, based on GPT-4 and deployed for non-clinical

Language:TypeScriptGPL-2.0000

graph-rag

Graph based retrieval + GenAI = Better RAG in production

000

LexicHash

A novel method for sequence similarity estimation

000

llm-jp-tokenizer

Language:RoffApache-2.0000

Local-Qdrant-RAG

Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerful language processing. #NLP #Qdrant #Embedding #Indexing

Language:Python000

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.0000

ranx

⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍

Language:PythonMIT000

RefChecker

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Apache-2.0000

selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

MIT000

zdocs

Docs of repo

Language:Jupyter Notebook000

arita37

PN's repositories

mybash

aaaa

awesome-japanese-nlp-resources

bashmy

10x-research-culture

advertools

AIF360

ascraper

ATLAS

Awesome-LLM4IE-Papers

bert_for_longer_texts

crosscheckgpt-dev

data

data2

dotfiles

fast-forward-indexes

fastapi-ml-skeleton

fastembed-rs

G-Retriever

GLiNER

GPT4DFCI

graph-rag

LexicHash

llm-jp-tokenizer

Local-Qdrant-RAG

ragflow

ranx

RefChecker

selfcheckgpt

zdocs