Beast code in Giters

免费的ChatGPT API的安卓语音助手，可用音量键唤起并进行语音交流，支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

Language:JavaGPL-3.0000

gpt-pilot

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

Language:PythonMIT000

groundingLMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

000

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

Language:Python000

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型

MIT000

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonNOASSERTION000

MetaGPT

🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo

Language:PythonMIT000

MiniGPT4Qwen

Cleaned Lavis + DeepSpeed Support! Align MiniGPT4 with Qwen-Chat LLM. I just use 18.8k high-quality instruction-tuning data(Bi-lingual, from minigpt4 and llava). Just fine-tune the projection layer.

000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.0000

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Apache-2.0000

RoadVision

Revolutionizing navigation with AR and MapKit integration, this iOS app offers immersive, real-time directions and customizable UI for an intuitive experience. #iOSDevelopment #AugmentedReality #MapKit #SwiftUI #Innovation

MIT000

SAM_gDINO_AutoLabeling

Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO

Apache-2.0000

SAMJS

Language:TypeScriptMIT000

screenshot-to-code

Drop in a screenshot and convert it to clean HTML/Tailwind/JS code

Language:TypeScriptMIT010

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookAGPL-3.0000

tfgbestneal

tfgbestneal's repositories

MobileVLM

agentsflow

ComfyUI

DetectSegPlatform

donkeycar

DriveLM

Focal_TSMP

Gemini

gpt-assistant-android

gpt-pilot

groundingLMM

InternLM-XComposer

InternVL

jepa

MetaGPT

MiniGPT4Qwen

Open-Sora

PaddleSpeech

RoadVision

SAM_gDINO_AutoLabeling

SAMJS

screenshot-to-code

Segment-and-Track-Anything

Segment-Everything-Everywhere-All-At-Once

SegmentAnything3D

SpeechAgents

tracking_ros

U-2-Net

X-AnyLabeling

YOLOV8_SAM