thinhhnt's repositories

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

ang-jsoneditor

Angular Jsoneditor that works with angular 4 to angular 15

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

arcface

Forked from https://github.com/deepinsight/insightface

Language:PythonStargazers:0Issues:0Issues:0

Awesome-Masked-Autoencoders

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

License:MITStargazers:0Issues:0Issues:0

big_vision

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DocDiff

DocDiff: Document Enhancement via Residual Diffusion Models. The first diffusion-based models designed for diverse document enhancement tasks. This model is lightweight, efficient, flexible and can also be used for img2img tasks in natural scenes.

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

LinkBERT

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

ngx-admin

Customizable admin dashboard template based on Angular 10+

License:MITStargazers:0Issues:0Issues:0

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

License:MITStargazers:0Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OpenLLM

Operating LLMs in production

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

RFnet

[3DV 22] Official implementation of Robust RGB-D Fusion Network for Saliency Detection

Stargazers:0Issues:0Issues:0

RGBD-SODsurvey

RGB-D Salient Object Detection: A Survey

Language:MATLABStargazers:0Issues:0Issues:0

Swin-MAE

Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805

Language:PythonStargazers:0Issues:0Issues:0

thinh-huynh-re

Config files for my GitHub profile.

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

TTACIL

Rethinking Class-incremental Learning in the Era of Large Pre-trained Models via Test-Time Adaptation

Language:PythonStargazers:0Issues:0Issues:0

unlimiformer

Public repo for the preprint "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

License:MITStargazers:0Issues:0Issues:0

webviewer-angular-sample

Sample to demonstrate integrating WebViewer into Angular

License:NOASSERTIONStargazers:0Issues:0Issues:0