zye1996

zye1996

Geek Repo

Company:GMU

Location:Fairfax, VA

Home Page:zye1996.github.io

Github PK Tool:Github PK Tool

zye1996's starred repositories

meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Language:RustLicense:MITStargazers:45515Issues:0Issues:0

YT-Spammer-Purge

Allows you easily scan for and delete scam comments using several methods.

Language:PythonLicense:GPL-3.0Stargazers:4532Issues:0Issues:0

EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

Language:PythonLicense:MITStargazers:192Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:14723Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:248Issues:0Issues:0

slidev

Presentation Slides for Developers

Language:TypeScriptLicense:MITStargazers:32127Issues:0Issues:0

autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Language:PythonLicense:Apache-2.0Stargazers:1719Issues:0Issues:0

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5792Issues:0Issues:0

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10915Issues:0Issues:0

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

Language:TypeScriptLicense:ISCStargazers:18267Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3880Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9302Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:31Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19241Issues:0Issues:0

DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

Language:C#Stargazers:563Issues:0Issues:0

CnSTD

CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包

Language:PythonLicense:Apache-2.0Stargazers:656Issues:0Issues:0

PaddleOCR2Pytorch

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Language:PythonLicense:Apache-2.0Stargazers:828Issues:0Issues:0

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1280Issues:0Issues:0

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1740Issues:0Issues:0

ai

Build AI-powered applications with React, Svelte, Vue, and Solid

Language:TypeScriptLicense:NOASSERTIONStargazers:8795Issues:0Issues:0

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1466Issues:0Issues:0

AutoPrompt

A framework for prompt tuning using Intent-based Prompt Calibration

Language:PythonLicense:Apache-2.0Stargazers:1918Issues:0Issues:0

Chinese-medical-dialogue-data

Chinese medical dialogue data 中文医疗对话数据集

Language:PythonLicense:MITStargazers:1091Issues:0Issues:0

vearch

Distributed vector search for AI-native applications

Language:GoLicense:Apache-2.0Stargazers:1990Issues:0Issues:0

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:293Issues:0Issues:0

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7584Issues:0Issues:0

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:32855Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8299Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11055Issues:0Issues:0