Cuog Ng (cuongngm)

cuongngm

Geek Repo

Company:ULTeam

Location:Hanoi, Vietnam

Home Page:https://cuongngm.github.io/

Github PK Tool:Github PK Tool

Cuog Ng's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:133033Issues:1038Issues:7441

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:50972Issues:501Issues:459

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14314Issues:87Issues:346

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:11135Issues:48Issues:123

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8283Issues:69Issues:190

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7190Issues:98Issues:1415

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Language:JavaScriptLicense:Apache-2.0Stargazers:5682Issues:82Issues:162

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:5435Issues:45Issues:288

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

CompreFace

Leading free and open-source face recognition system

Language:JavaLicense:Apache-2.0Stargazers:4683Issues:78Issues:278
Language:Jupyter NotebookLicense:MITStargazers:4192Issues:71Issues:17

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3356Issues:30Issues:249

anylabeling

Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything, MobileSAM!!

Language:PythonLicense:GPL-3.0Stargazers:1935Issues:20Issues:119

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Language:PythonLicense:BSD-3-ClauseStargazers:1617Issues:15Issues:21

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1062Issues:26Issues:138

salt

Segment Anything Labelling Tool

Language:PythonLicense:MITStargazers:993Issues:9Issues:36

parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:511Issues:13Issues:133

OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

vnstock

A powerful Python library for getting rich data from the Vietnam Stock Market using just a few lines of code

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:411Issues:35Issues:52

tryondiffusion

TryOnDiffusion: A Tale of Two UNets Implementation

Language:Jupyter NotebookStargazers:309Issues:32Issues:19

MiVOLO

MiVOLO age & gender transformer neural network

ViFi-CLIP

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Language:PythonLicense:MITStargazers:220Issues:9Issues:20

TableGeneration

通过浏览器渲染生成表格图像

Language:PythonLicense:MITStargazers:168Issues:5Issues:12

mindocr

A toolbox of OCR models, algorithms, and pipelines based on MindSpore

Language:PythonLicense:Apache-2.0Stargazers:165Issues:14Issues:91

layoutlmv3-triton-server

An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model

IQA_IE_receipt

Code for the report "MC-OCR Challenge 2021: Simple approach for receipt information extraction and quality evaluation" RIVF2021

vietocr

Vision Transformer OCR

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:0Issues:0

KAGGLE-RSNA-Screening-Mammography-Breast-Cancer-Detection

code for kaggle competition,titled "RSNA Screening Mammography Breast Cancer Detection"

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0