zhch158

zhch158

Geek Repo

Location:china

Github PK Tool:Github PK Tool

zhch158's starred repositories

quivr

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework

Language:PythonLicense:NOASSERTIONStargazers:36287Issues:0Issues:0

prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Language:PythonLicense:Apache-2.0Stargazers:772Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:9218Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1779Issues:0Issues:0

vscode-remote-try-python

Python sample project for trying out Dev Containers

Language:PythonLicense:MITStargazers:766Issues:0Issues:0

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonLicense:MITStargazers:2241Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8865Issues:0Issues:0

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Language:PythonLicense:Apache-2.0Stargazers:4844Issues:0Issues:0

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16825Issues:0Issues:0

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:19741Issues:0Issues:0

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Stargazers:28657Issues:0Issues:0

chatgptProxyAPI

🔥 使用cloudflare 搭建免费的 OpenAI api代理 ,解决网络无法访问问题。支持流式输出

Language:HTMLLicense:MITStargazers:2941Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:19286Issues:0Issues:0

sec-insights

A real world full-stack application using LlamaIndex

Language:TypeScriptLicense:MITStargazers:2340Issues:0Issues:0

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:MITStargazers:43596Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13692Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:7125Issues:0Issues:0

llama_parse

Parse files for optimal RAG

Language:PythonLicense:MITStargazers:2808Issues:0Issues:0

ImageMagick

🧙‍♂️ ImageMagick 7

Language:CLicense:NOASSERTIONStargazers:12099Issues:0Issues:0

SLICEmyPDF

This project uses SLICE algorithm to extract information from a text-based PDF page containing financial statements (tabular data). It can also be used to extract regular tables but will contain all text on a page.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:60Issues:0Issues:0

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonLicense:MITStargazers:5877Issues:0Issues:0

tabula-py

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame

Language:PythonLicense:MITStargazers:2169Issues:0Issues:0

camelot

Camelot: PDF Table Extraction for Humans

Language:PythonLicense:NOASSERTIONStargazers:3647Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:61643Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:3093Issues:0Issues:0

poppler-windows

Download Poppler binaries packaged for Windows with dependencies

Language:ShellLicense:MITStargazers:542Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17757Issues:0Issues:0

pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language:HTMLLicense:NOASSERTIONStargazers:3771Issues:0Issues:0
Language:PythonLicense:MITStargazers:5189Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2639Issues:0Issues:0