datalee's starred repositories

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Language:PythonLicense:AGPL-3.0Stargazers:12662Issues:111Issues:1297

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

supermemory

Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension.

Language:TypeScriptLicense:MITStargazers:6599Issues:27Issues:142

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptLicense:MITStargazers:2669Issues:27Issues:16

llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1458Issues:13Issues:6

Memary

The Open Source Memory Layer For Autonomous Agents

Language:Jupyter NotebookLicense:MITStargazers:1403Issues:14Issues:28

openai-forward

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Language:PythonLicense:MITStargazers:818Issues:5Issues:60

JSON-Viewer

A JSON viewer plugin for Notepad++. Displays the selected JSON string in a tree view.

Language:C++License:MITStargazers:742Issues:35Issues:133

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonLicense:MITStargazers:717Issues:6Issues:53
Language:PythonLicense:BSD-3-ClauseStargazers:544Issues:7Issues:66

LabelLLM

The Open-Source Data Annotation Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:513Issues:9Issues:25

search_with_ai

🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。

Language:TypeScriptLicense:MITStargazers:489Issues:9Issues:35

ragbuilder

A toolkit to create optimal Production-ready RAG setup for your data

Language:PythonLicense:Apache-2.0Stargazers:420Issues:6Issues:8

metaso-free-api

🚀 秘塔AI搜索逆向API白嫖测试【特长:超强检索超长输出】,支持高速流式输出、超强联网搜索(全网or学术以及简洁、深入、研究三种模式),零配置部署,多路token支持

Language:TypeScriptLicense:GPL-3.0Stargazers:418Issues:10Issues:21

langsmith-sdk

LangSmith Client SDK Implementations

Language:PythonLicense:MITStargazers:384Issues:7Issues:227

freshqa

Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:321Issues:17Issues:2
Language:PythonLicense:Apache-2.0Stargazers:199Issues:6Issues:7

filco

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Language:PythonLicense:CC-BY-SA-4.0Stargazers:182Issues:2Issues:9

flexible-clustering

Clustering for arbitrary data and dissimilarity function

Language:PythonLicense:BSD-3-ClauseStargazers:85Issues:4Issues:6

sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Language:ShellLicense:Apache-2.0Stargazers:85Issues:6Issues:12

recomp

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.

Language:PythonLicense:MITStargazers:81Issues:4Issues:8

rag-best-practices

大模型检索增强生成技术最佳实践。

Language:PythonLicense:LGPL-3.0Stargazers:33Issues:0Issues:0

Llama-3-SynE

Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 Llama-3 的科学推理和中文能力

ConvGQR

ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.

lmdeploy-build

Nightly Build for LMDeploy

Language:PowerShellLicense:MITStargazers:9Issues:0Issues:0

rrf

reciprocal rank fusion with bm25 and semantic search

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

conv-llm

Official repository of the paper: Galimzhanova et al., "Rewriting Conversational Utterances with Instructed Large Language Models", Long Paper @ WI-IAT 2023.

Language:PythonLicense:MITStargazers:3Issues:4Issues:0
Language:PythonStargazers:2Issues:0Issues:0