Tracy Shen (tbs17)

tbs17

Geek Repo

Company:Unstructured

Location:State College, PA

Home Page:https://thinkregressively.netlify.app/

Github PK Tool:Github PK Tool

Tracy Shen's starred repositories

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:13515Issues:0Issues:0

langchain-pandas

Example of how to use LangChain and Vertex AI Generative AI to ask plain English questions about Pandas dataframes.

Language:PythonStargazers:4Issues:0Issues:0

uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language:PythonLicense:Apache-2.0Stargazers:954Issues:0Issues:0

pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Language:PythonLicense:MITStargazers:5868Issues:0Issues:0

unstructured-python-client

A Python client for the Unstructured hosted API

Language:PythonLicense:MITStargazers:55Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23260Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14267Issues:0Issues:0

InferSent

InferSent sentence embeddings

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2278Issues:0Issues:0

audio-examples

Sample audio files

Stargazers:3Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:22664Issues:0Issues:0

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5478Issues:0Issues:0

llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Language:PythonLicense:MITStargazers:472Issues:0Issues:0

UniTAB

UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)

Language:PythonLicense:MITStargazers:82Issues:0Issues:0

examples

📝 Examples of how to use Neptune for different use cases and with various MLOps tools

Language:Jupyter NotebookLicense:MITStargazers:72Issues:0Issues:0

cjm-yolox-pytorch

A PyTorch implementation of the YOLOX object detection model based on the mmdetection implementation.

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0

modal-examples

Examples of programs built using Modal

Language:PythonLicense:MITStargazers:629Issues:0Issues:0

project-images-segmentation

Experiment tracking and model registry in the images segmentation project

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:257Issues:0Issues:0

SuperCLUE-RAG

中文原生检索增强生成测评基准

Stargazers:74Issues:0Issues:0

MTL-TabNet

MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition

Language:PythonLicense:Apache-2.0Stargazers:79Issues:0Issues:0

DCNv2_latest

DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)

Language:C++License:BSD-3-ClauseStargazers:604Issues:0Issues:0

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonLicense:MITStargazers:7194Issues:0Issues:0

pytorch-retinanet

RetinaNet in PyTorch

Language:PythonStargazers:989Issues:0Issues:0

cocoapi

COCO API - Dataset @ http://cocodataset.org/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5999Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1104Issues:0Issues:0

tessdata

Trained models with fast variant of the "best" LSTM models + legacy models

License:Apache-2.0Stargazers:6065Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:59286Issues:0Issues:0

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:137Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:28398Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:9650Issues:0Issues:0