Phan Hoang (huyhoang17)

huyhoang17

Geek Repo

Company:HUST

Location:Hanoi, Vietnam

Home Page:https://viblo.asia/u/phanhoang

Twitter:@__phanhoang__

Github PK Tool:Github PK Tool

Phan Hoang's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:20456Issues:163Issues:145

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:18899Issues:197Issues:100

uv

An extremely fast Python package installer and resolver, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:11863Issues:27Issues:1437

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:11430Issues:98Issues:163

pyo3

Rust bindings for the Python interpreter

Language:RustLicense:NOASSERTIONStargazers:11150Issues:91Issues:1263

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3906Issues:52Issues:110

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3530Issues:109Issues:55

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3207Issues:38Issues:283

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3017Issues:25Issues:110

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:2497Issues:19Issues:21

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:Jupyter NotebookLicense:MITStargazers:2358Issues:31Issues:149

prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Language:PythonLicense:Apache-2.0Stargazers:1904Issues:25Issues:167

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

MiniCPM-V

MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities

Language:PythonLicense:Apache-2.0Stargazers:1434Issues:28Issues:72

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1433Issues:19Issues:83

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1108Issues:13Issues:12

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:986Issues:26Issues:68

Arc2Face

Arc2Face: A Foundation Model of Human Faces

Language:PythonLicense:MITStargazers:453Issues:14Issues:17

segmenteverygrain

A SAM-based model for instance segmentation of images of grains

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:427Issues:16Issues:11

NIST-FRVT-Top-1-Face-Recognition

Face Recognition, Face Liveness Detection, Face Attribute Analysis (Age & Gender, Emotion, Demographics, Ethnicity and many more.)

Language:PythonStargazers:219Issues:5Issues:0

LM4VisualEncoding

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Language:PythonLicense:MITStargazers:190Issues:4Issues:7

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:162Issues:9Issues:4

ViTST

[NeurIPS 2023] The official repo for the paper: "Time Series as Images: Vision Transformer for Irregularly Sampled Time Series"."

Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

Language:PythonLicense:MITStargazers:67Issues:0Issues:0

Seg-NN

[CVPR2024 Hightlight] No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Language:Jupyter NotebookStargazers:45Issues:2Issues:1

UVDoc

Code for the paper "UVDoc: Neural Grid-based Document Unwarping"

Language:C++License:MITStargazers:45Issues:0Issues:0

LaVy

Pioneering in Vietnamese Multimodal Large Language Model

Language:PythonStargazers:28Issues:0Issues:0

distortion-generator

Neural network for creating distortion while keeping embeddings as close as possible

Language:PythonLicense:Apache-2.0Stargazers:18Issues:3Issues:0