PN's repositories
awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
10x-research-culture
Research in Cultural Understanding and Biases in LLMs
advertools
advertools - online marketing productivity and analysis tools - in dash
AIF360
A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
Awesome-LLM4IE-Papers
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
crosscheckgpt-dev
CrossCheckGPT internal dev
data2
Text NLp data
dotfiles
:wrench: .files, including ~/.macos — sensible hacker defaults for macOS
fast-forward-indexes
Efficient interpolation-based ranking on CPUs
fastapi-ml-skeleton
FastAPI Skeleton App to serve machine learning models production-ready.
fastembed-rs
Library to generate text embeddings in Rust
G-Retriever
Repository for G-Retriever
GLiNER
Generalist model for NER (Extract any entity types from texts)
GPT4DFCI
generative AI tool, based on GPT-4 and deployed for non-clinical
graph-rag
Graph based retrieval + GenAI = Better RAG in production
LexicHash
A novel method for sequence similarity estimation
Local-Qdrant-RAG
Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerful language processing. #NLP #Qdrant #Embedding #Indexing
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
ranx
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
zdocs
Docs of repo