Tracy Shen (tbs17)

tbs17

Geek Repo

Company:Unstructured

Location:State College, PA

Home Page:https://thinkregressively.netlify.app/

Github PK Tool:Github PK Tool

Tracy Shen's starred repositories

promptfoo

Test your prompts, agents, and RAGs. Redteaming, pentesting, vulnerability scanning for LLMs. Improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Language:TypeScriptLicense:MITStargazers:3782Issues:0Issues:0

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language:PythonLicense:NOASSERTIONStargazers:2865Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:15069Issues:0Issues:0

langchain-pandas

Example of how to use LangChain and Vertex AI Generative AI to ask plain English questions about Pandas dataframes.

Language:PythonStargazers:4Issues:0Issues:0

uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language:PythonLicense:Apache-2.0Stargazers:989Issues:0Issues:0

pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Language:PythonLicense:MITStargazers:6130Issues:0Issues:0

unstructured-python-client

A Python client for the Unstructured hosted API

Language:PythonLicense:MITStargazers:63Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23974Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14570Issues:0Issues:0

InferSent

InferSent sentence embeddings

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2278Issues:0Issues:0

audio-examples

Sample audio files

Stargazers:3Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23081Issues:0Issues:0

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5587Issues:0Issues:0

llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

Language:PythonLicense:MITStargazers:497Issues:0Issues:0

UniTAB

UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)

Language:PythonLicense:MITStargazers:84Issues:0Issues:0

examples

📝 Examples of how to use Neptune for different use cases and with various MLOps tools

Language:Jupyter NotebookLicense:MITStargazers:75Issues:0Issues:0

cjm-yolox-pytorch

A PyTorch implementation of the YOLOX object detection model based on the mmdetection implementation.

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0

modal-examples

Examples of programs built using Modal

Language:PythonLicense:MITStargazers:660Issues:0Issues:0

project-images-segmentation

Experiment tracking and model registry in the images segmentation project

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:302Issues:0Issues:0

SuperCLUE-RAG

中文原生检索增强生成测评基准

Stargazers:82Issues:0Issues:0

MTL-TabNet

MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition

Language:PythonLicense:Apache-2.0Stargazers:80Issues:0Issues:0

DCNv2_latest

DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)

Language:C++License:BSD-3-ClauseStargazers:616Issues:0Issues:0

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonLicense:MITStargazers:7217Issues:0Issues:0

pytorch-retinanet

RetinaNet in PyTorch

Language:PythonStargazers:991Issues:0Issues:0

cocoapi

COCO API - Dataset @ http://cocodataset.org/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6034Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1239Issues:0Issues:0

tessdata

Trained models with fast variant of the "best" LSTM models + legacy models

License:Apache-2.0Stargazers:6163Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:60054Issues:0Issues:0

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0