Phan Hoang (huyhoang17)

huyhoang17

Geek Repo

Company:HUST

Location:Hanoi, Vietnam

Home Page:https://viblo.asia/u/phanhoang

Twitter:@__phanhoang__

Github PK Tool:Github PK Tool

Phan Hoang's starred repositories

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:13866Issues:121Issues:122

litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Language:PythonLicense:NOASSERTIONStargazers:12323Issues:67Issues:3093

linkedIn_auto_jobs_applier_with_AI

LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.

Language:PythonLicense:MITStargazers:12144Issues:82Issues:263

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:11428Issues:68Issues:385

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6383Issues:74Issues:13

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:AGPL-3.0Stargazers:4797Issues:30Issues:97

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonLicense:Apache-2.0Stargazers:4639Issues:34Issues:131

vispy

Main repository for Vispy

Language:PythonLicense:NOASSERTIONStargazers:3295Issues:115Issues:1437

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonLicense:BSD-2-ClauseStargazers:2972Issues:35Issues:71

sports

computer vision and sports

Language:PythonLicense:MITStargazers:2275Issues:46Issues:18

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:2086Issues:21Issues:187

awesome-llm-json

Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1565Issues:22Issues:66

AdalFlow

AdalFlow: The library to build & auto-optimize any LLM tasks.

Language:PythonLicense:MITStargazers:1333Issues:17Issues:27

BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Language:PythonLicense:MITStargazers:1029Issues:12Issues:59

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language:PythonLicense:Apache-2.0Stargazers:830Issues:0Issues:0

TexTeller

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

Language:PythonLicense:Apache-2.0Stargazers:304Issues:4Issues:14

EVE

EVE: Encoder-Free Vision-Language Models

Language:PythonLicense:MITStargazers:208Issues:8Issues:14

Hi-SAM

[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Language:PythonLicense:Apache-2.0Stargazers:189Issues:12Issues:18

UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Language:PythonLicense:Apache-2.0Stargazers:165Issues:9Issues:19

uvtrick

A fun party trick to run Python code from another venv into this one.

Language:PythonLicense:MITStargazers:129Issues:1Issues:3

DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:94Issues:5Issues:4

CounTR

CounTR: Transformer-based Generalised Visual Counting

Language:PythonLicense:MITStargazers:93Issues:6Issues:43

StructEqTable-Deploy

A High-efficiency Open-source Toolkit for Table-to-Latex Task

Language:PythonLicense:Apache-2.0Stargazers:90Issues:5Issues:7

helibrunna

A HuggingFace compatible xLSTM trainer.

Language:PythonLicense:AGPL-3.0Stargazers:60Issues:0Issues:0
Language:PythonStargazers:55Issues:0Issues:0

DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Language:PythonLicense:MITStargazers:46Issues:0Issues:0

NAF-DPM

NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement

Language:PythonLicense:MITStargazers:28Issues:1Issues:6

SEMv3

The official PyTorch implementation of SEMv3.

Language:PythonLicense:Apache-2.0Stargazers:21Issues:2Issues:2

SimChart9K

The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.