Mr.Li (bobo0810)

bobo0810

Geek Repo

Company:North University of China

Location:Beijing

Github PK Tool:Github PK Tool

Mr.Li's starred repositories

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:90610Issues:516Issues:202

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18453Issues:158Issues:1423

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9864Issues:139Issues:31

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8672Issues:57Issues:479

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7525Issues:110Issues:291

LeetCode-Book

《剑指 Offer》 Python, Java, C++ 解题代码,LeetBook《图解算法数据结构》配套代码仓

Language:JavaLicense:NOASSERTIONStargazers:5569Issues:46Issues:7

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5194Issues:56Issues:975

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4514Issues:45Issues:390

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:PythonLicense:Apache-2.0Stargazers:4479Issues:52Issues:143

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4076Issues:37Issues:397

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:3659Issues:40Issues:320

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2750Issues:26Issues:174

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2627Issues:32Issues:154

cv_note

记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/

Language:C++License:Apache-2.0Stargazers:2321Issues:30Issues:4

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1913Issues:18Issues:45

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1863Issues:23Issues:84
Language:PythonLicense:Apache-2.0Stargazers:1714Issues:123Issues:20

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:906Issues:21Issues:54

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:826Issues:19Issues:102

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:796Issues:12Issues:109

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:644Issues:14Issues:52

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:487Issues:12Issues:31

VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:229Issues:11Issues:11

NeteaseTVDemo

NeteaseTVDemo (Vibefy) - tvOS 客户端

Language:SwiftLicense:GPL-2.0Stargazers:215Issues:7Issues:14

DataOptim

A collection of visual instruction tuning datasets.

Language:PythonLicense:MITStargazers:73Issues:5Issues:0