zye1996

zye1996

Geek Repo

Company:GMU

Location:Fairfax, VA

Home Page:zye1996.github.io

Github PK Tool:Github PK Tool

zye1996's starred repositories

meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

slidev

Presentation Slides for Developers

Language:TypeScriptLicense:MITStargazers:32126Issues:134Issues:987

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19240Issues:297Issues:1339

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

Language:TypeScriptLicense:ISCStargazers:18268Issues:117Issues:112

amplication

🔥🔥🔥 The Only Production-Ready AI-Powered Backend Code Generation

Language:TypeScriptLicense:NOASSERTIONStargazers:14822Issues:91Issues:3625

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:14718Issues:60Issues:179

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12580Issues:78Issues:809

cobalt

save what you love

Language:JavaScriptLicense:AGPL-3.0Stargazers:11689Issues:53Issues:372

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10915Issues:102Issues:194

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9302Issues:79Issues:100

awesome-software-architecture

🚀 A curated list of awesome articles, videos, and other resources to learn and practice software architecture, patterns, and principles.

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:6949Issues:59Issues:159

ChatLaw

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5787Issues:37Issues:287

YT-Spammer-Purge

Allows you easily scan for and delete scam comments using several methods.

Language:PythonLicense:GPL-3.0Stargazers:4532Issues:47Issues:508

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3879Issues:114Issues:73

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2132Issues:31Issues:154

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonLicense:Apache-2.0Stargazers:1718Issues:19Issues:93

infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Language:PythonLicense:MITStargazers:1078Issues:17Issues:109

PaddleOCR2Pytorch

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Language:PythonLicense:Apache-2.0Stargazers:827Issues:15Issues:84

CnSTD

CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包

Language:PythonLicense:Apache-2.0Stargazers:657Issues:14Issues:48

DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonLicense:Apache-2.0Stargazers:466Issues:11Issues:21

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonLicense:Apache-2.0Stargazers:443Issues:10Issues:87
Language:PythonLicense:Apache-2.0Stargazers:248Issues:6Issues:11

EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

Language:PythonLicense:MITStargazers:192Issues:6Issues:11

reddit-dataset

Dataset of threads and comments from reddit

LexLIP-ICCV23

Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval"

Language:PythonLicense:Apache-2.0Stargazers:36Issues:2Issues:6
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:31Issues:1Issues:5