SLTK1

followers

following

stars

ShenZhen,China

Zheng Bowen's starred repositories

langflow

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Language:PythonMIT26072 202 1238

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.016136 132 123

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.015663 101 988

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9861 84 247

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.08024 55 1496

ARC-AGI

The Abstraction and Reasoning Corpus

Language:JavaScriptApache-2.03245 96 65

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.01673 23 62

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1492 35 125

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01413 23 60

License-Plate-Detector

基于Yolov5车牌检测,更快更准.

Language:Python1189 38 72

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT1164 21 52

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:Python784 10 32

groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Language:Python732 28 68

camera_calibration

Accurate geometric camera calibration with generic camera models

Language:C++BSD-3-Clause690 29 67

libSGM

Stereo Semi Global Matching by cuda

Language:C++Apache-2.0609 31 69

Segmentation-Pytorch

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

Language:Python452 6 28

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaMIT426 6 10

Rotated_IoU

Differentiable IoU of rotated bounding boxes using Pytorch

Language:PythonMIT411 9 54

awesome-large-multimodal-agents

Hi-SAM

[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Language:PythonApache-2.0183 12 18

DocScanner

The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”.

Language:PythonNOASSERTION148 18 9

OneChart

[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"

Language:PythonApache-2.0134 1 16

Awesome-Chart-Understanding

A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.

Table-LLaVA

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.

Language:PythonApache-2.0121 6 8

Ant-Multi-Modal-Framework

Research Code for Multimodal-Cognition Team in Ant Group

Language:PythonCC-BY-4.0101 4 18

awesome-table-structure-recognition

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

Apache-2.0100 5 3

DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Language:Jupyter NotebookCC-BY-4.085 4 4

RapidLayout

Analysis of Chinese and English layouts 中英文版面分析

Language:PythonApache-2.081 40

vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.027 20

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION100