Tracy Shen (tbs17)

tbs17

Geek Repo

Company:Unstructured

Location:State College, PA

Home Page:https://thinkregressively.netlify.app/

Github PK Tool:Github PK Tool

Tracy Shen's starred repositories

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:28529Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:11197Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18014Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10500Issues:0Issues:0

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1129Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13100Issues:0Issues:0

DAVAR-Lab-OCR

OCR toolbox from Davar-Lab

Language:PythonLicense:Apache-2.0Stargazers:721Issues:0Issues:0

llama_parse

Parse files for optimal RAG

Language:PythonLicense:MITStargazers:1824Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:380Issues:0Issues:0
License:Apache-2.0Stargazers:28Issues:0Issues:0

DocLayNet

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

License:NOASSERTIONStargazers:205Issues:0Issues:0

SciTSR

Table structure recognition dataset of the paper: Complicated Table Structure Recognition

Language:PythonLicense:MITStargazers:336Issues:0Issues:0

StringZilla

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖

Language:C++License:Apache-2.0Stargazers:1894Issues:0Issues:0

CascadeTabNet

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

Language:PythonLicense:MITStargazers:1464Issues:0Issues:0

dvc

🦉 ML Experiments and Data Management with Git

Language:PythonLicense:Apache-2.0Stargazers:13384Issues:0Issues:0

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:9164Issues:0Issues:0

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2468Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:40317Issues:0Issues:0

TableBank

TableBank: A Benchmark Dataset for Table Detection and Recognition

License:Apache-2.0Stargazers:987Issues:0Issues:0

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonLicense:MITStargazers:2027Issues:0Issues:0

01

The open-source language model computer

Language:PythonLicense:AGPL-3.0Stargazers:4758Issues:0Issues:0

LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

Language:Jupyter NotebookStargazers:393Issues:0Issues:0

reor

Private & local AI personal knowledge management app.

Language:TypeScriptLicense:AGPL-3.0Stargazers:6526Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:2933Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7510Issues:0Issues:0

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:15435Issues:0Issues:0

ParlayANN

A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such algorithms.

Language:C++License:MITStargazers:85Issues:0Issues:0

financial-vss

Notebooks demonstrating vector search & RAG design patterns with Redis Python clients.

Language:Jupyter NotebookLicense:MITStargazers:7Issues:0Issues:0

chatgpt-memory

Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.

Language:PythonLicense:Apache-2.0Stargazers:513Issues:0Issues:0

redis-product-search

Visual and semantic vector similarity with Redis Stack, FastAPI, PyTorch and Huggingface.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:151Issues:0Issues:0