bytemaster's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:63436Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:9850Issues:0Issues:0

Facial-Expression-Recognition.Pytorch

A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset

Language:PythonLicense:MITStargazers:1753Issues:0Issues:0

Unitree_Tutorials

宇树机器人具身智能教程-同济子豪兄

Stargazers:12Issues:0Issues:0

VILA

VILA - A multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:200Issues:0Issues:0

Driving-with-LLMs

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:350Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:60561Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2594Issues:0Issues:0

carla_apollo_bridge

This project aims to provide a data and control bridge for the communication between the latest version of Apollo and Carla.

Language:PythonLicense:Apache-2.0Stargazers:306Issues:0Issues:0

RCBEVDet

[CVPR 2024] RCBEVDet: Radar-camera Fusion in Bird’s Eye View for 3D Object Detection

Stargazers:49Issues:0Issues:0
Language:PythonStargazers:811Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7464Issues:0Issues:0

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:1008Issues:0Issues:0

InternVideo

Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1075Issues:0Issues:0

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:2955Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10142Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17616Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3649Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:5392Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12220Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22303Issues:0Issues:0

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:58876Issues:0Issues:0

chat-with-mlx

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

Language:PythonLicense:MITStargazers:1337Issues:0Issues:0
Stargazers:339Issues:0Issues:0

json

JSON for Modern C++

Language:C++License:MITStargazers:41001Issues:0Issues:0

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:12459Issues:0Issues:0

muduo

Event-driven network library for multi-threaded Linux server in C++11

Language:C++License:NOASSERTIONStargazers:14438Issues:0Issues:0

U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Language:PythonLicense:Apache-2.0Stargazers:8269Issues:0Issues:0

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Language:PythonStargazers:303Issues:0Issues:0

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Stargazers:27736Issues:0Issues:0