bytemaster's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66723Issues:559Issues:0

llama.cpp

LLM inference in C/C++

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:59790Issues:463Issues:1314

json

JSON for Modern C++

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25787Issues:212Issues:229

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18951Issues:158Issues:1458

muduo

Event-driven network library for multi-threaded Linux server in C++11

Language:C++License:NOASSERTIONStargazers:14628Issues:779Issues:374

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:13299Issues:249Issues:105

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13109Issues:98Issues:1039

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11205Issues:96Issues:463

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:11096Issues:122Issues:658

U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Language:PythonLicense:Apache-2.0Stargazers:8443Issues:142Issues:339

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5263Issues:49Issues:490

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:3488Issues:30Issues:569

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2670Issues:37Issues:134
Language:PythonLicense:Apache-2.0Stargazers:2159Issues:32Issues:171

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:2058Issues:15Issues:55

Facial-Expression-Recognition.Pytorch

A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset

Language:PythonLicense:MITStargazers:1775Issues:31Issues:142

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1667Issues:27Issues:109

chat-with-mlx

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

Language:PythonLicense:MITStargazers:1447Issues:12Issues:50

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1265Issues:29Issues:149

Driving-with-LLMs

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:397Issues:16Issues:24

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

carla_apollo_bridge

This project aims to provide a data and control bridge for the communication between the latest version of Apollo and Carla.

Language:PythonLicense:Apache-2.0Stargazers:318Issues:6Issues:139

RCBEVDet

[CVPR 2024] RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection

Unitree_Tutorials

宇树机器人具身智能教程-同济子豪兄