SIGMIND

SIGMIND

Geek Repo

Company:Sigmind Limited

Location:Dhaka, Bangladesh

Home Page:https://sigmind.ai

Twitter:@sigmindAI

Github PK Tool:Github PK Tool

SIGMIND's starred repositories

llama.cpp

LLM inference in C/C++

private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:53526Issues:457Issues:1159

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49411Issues:562Issues:209

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:34588Issues:197Issues:400

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:29712Issues:218Issues:540

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28326Issues:212Issues:229

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19219Issues:158Issues:1477

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5741Issues:72Issues:444

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3709Issues:77Issues:123

face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Language:PythonLicense:MITStargazers:3414Issues:111Issues:187

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2701Issues:32Issues:155

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2607Issues:36Issues:52

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language:PythonLicense:Apache-2.0Stargazers:2118Issues:55Issues:41

QualityScaler

QualityScaler - image/video AI upscaler app

Language:PythonLicense:MITStargazers:1969Issues:31Issues:92

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1761Issues:26Issues:118

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1149Issues:15Issues:119

oterm

a text-based terminal client for Ollama

Language:PythonLicense:MITStargazers:959Issues:10Issues:65

PLLaVA

Official repository for the paper PLLaVA

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:531Issues:12Issues:36

JetsonGPIO

A C++ library that enables the use of Jetson's GPIOs

Language:C++License:MITStargazers:277Issues:6Issues:70

awesome-large-action-model

Awesome Large Action Model (LAM): Models that could help gets things done.

OnvifDeviceManager

Onvif Device Manager for Linux

Language:CLicense:GPL-3.0Stargazers:83Issues:5Issues:25

JETGPIO

C library to manage the GPIO header of the Nvidia Jetson boards

Language:CLicense:MITStargazers:73Issues:6Issues:19

vlm-rlaif

ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Language:PythonLicense:Apache-2.0Stargazers:36Issues:3Issues:3

mmj_genai

A reference example for integrating NanoOwl with Metropolis Microservices for Jetson

Language:PythonLicense:Apache-2.0Stargazers:25Issues:4Issues:3

trt-llm-rag-linux

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Linux using TensorRT-LLM

Language:PythonLicense:NOASSERTIONStargazers:19Issues:3Issues:6

Multimodal-RAG-on-Jetson

This project has implemented the RAG function on Jetson with video formats.

Language:PythonLicense:MITStargazers:6Issues:1Issues:0

Okkhor-Diffusion

Okkhor-Diffusion: Bangla Handwritten Character Generation using DDPM

Language:PythonStargazers:5Issues:0Issues:0

linux-copilot

A simple co-pilot for Linux to interpret human language queries into useful Linux terminal commands and execute them

Language:PythonLicense:MITStargazers:4Issues:1Issues:0