Jina AI's repositories
node-DeepResearch
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
correlations
Simple UI for debugging correlations of text embeddings
meta-prompt
For LLMs to better code with Jina API
mlx-retrieval
Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX
deepsearch-ui
Jina DeepSearch UI
submodular-optimization
Submodular optimization for context engineering: query fan-out, text selection, passage reranking
llm-query-expansion
Query Expension for Better Query Embedding using LLMs
jina-embeddings-v4-gguf
A collection of GGUF and quantizations for jina-embeddings-v4
jina-sagemaker
Jina Embedding Models on AWS SageMaker
mteb-long-documents
MTEB: Massive Text Embedding Benchmark
puppeteer-extra-plugin-page-proxy
Additional module to use with 'puppeteer' for setting proxies per page basis.
bof-emnlp2025-embeddings-rerankers-smallLMs-for-better-search
Nov. 7 EMNLP2025 BoF: Embeddings, Rerankers, Small LMs for Better Search
jina-reranker-m0-gguf
A collection of GGUF and quantizations for jina-embeddings-v4
mteb-jinavdr
MTEB: Massive Text Embedding Benchmark
gpt-oss
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
multimodal-reranker-test
samples to evaluate a multimodal neural reranker