Nguyễn Hoàng Long (oggyfaker)

oggyfaker

Geek Repo

Company:Fruit AI Researcher

Location:Ho Chi Minh city

Github PK Tool:Github PK Tool

Nguyễn Hoàng Long 's starred repositories

LynxHub

Manage and launch all your AI from a single dashboard.

Language:TypeScriptLicense:GPL-3.0Stargazers:99Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

ibis

the portable Python dataframe library

Language:PythonLicense:Apache-2.0Stargazers:5126Issues:0Issues:0

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Language:PythonStargazers:4661Issues:0Issues:0

build-nanogpt

Video+code lecture on building nanoGPT from scratch

Language:PythonStargazers:3433Issues:0Issues:0

toma

Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory

Language:PythonLicense:MITStargazers:420Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1428Issues:0Issues:0

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:14272Issues:0Issues:0

hydra

Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"

Language:PythonStargazers:97Issues:0Issues:0

segment-caption-anything

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.

Language:PythonLicense:Apache-2.0Stargazers:186Issues:0Issues:0

masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Language:PythonLicense:Apache-2.0Stargazers:967Issues:0Issues:0

GPTViet

This project aims to develop a bilingual foundation model with both language and multimodal capabilities. The objective is to enhance an existing open-source English model, optimizing it for the Vietnamese 🇻🇳 language.

Language:PythonLicense:Apache-2.0Stargazers:8Issues:0Issues:0
Language:ShellStargazers:4771Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:926Issues:0Issues:0

MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Language:PythonLicense:Apache-2.0Stargazers:596Issues:0Issues:0

EEG-Conformer

EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.

Language:PythonLicense:GPL-3.0Stargazers:405Issues:0Issues:0

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

conformer

PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

conformer_ocr

Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This project only focused on variants of vanilla Transformer (Conformer) and Feature Extraction (CNN-based approach).

Language:PythonStargazers:9Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

Conformer

Implementing automatic speech recognition Conformer in PyTorch on Librispeech-100

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4606Issues:0Issues:0

Arc2Face

[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces

Language:PythonLicense:MITStargazers:567Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12702Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8619Issues:0Issues:0

chat-with-mlx

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

Language:PythonLicense:MITStargazers:1463Issues:0Issues:0

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28639Issues:0Issues:0

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2366Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:56369Issues:0Issues:0

SwinTextSpotter

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Language:PythonStargazers:268Issues:0Issues:0