maxgreco's starred repositories

multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Language:PythonLicense:MITStargazers:1005Issues:0Issues:0

shiny-standalone-webr-demo

Demonstration of using a JavaScript ServiceWorker to communicate with a running Shiny/httpuv session in webR.

Language:JavaScriptLicense:MITStargazers:59Issues:0Issues:0

webr.bundle

Bundle Shiny Applications for serving with WebR.

Language:RustStargazers:26Issues:0Issues:0

dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

Language:C++License:MPL-2.0Stargazers:396Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10524Issues:0Issues:0

hydrus

A personal booru-style media tagger that can import files and tags from your hard drive and popular websites. Content can be shared with other users via user-run servers.

Language:PythonLicense:NOASSERTIONStargazers:2278Issues:0Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4238Issues:0Issues:0

machinascript-for-robots

Build LLM-powered robots in your garage with MachinaScript For Robots!

Language:PythonLicense:Apache-2.0Stargazers:151Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1850Issues:0Issues:0

RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

Language:TypeScriptLicense:MITStargazers:4838Issues:0Issues:0
Language:PythonLicense:MITStargazers:4265Issues:0Issues:0

TensorFlow-CUDA-Windows-Installation-Guide

TensorFlow 2 with GPU on Windows: Step-by-step instructions how install CUDA and cuDNN on Windows to use TensorFlow with GPU support

License:Apache-2.0Stargazers:3Issues:0Issues:0

CUDA-Install-Guide

Installation guide for NVIDIA driver, CUDA, cuDNN and TensorRT

Stargazers:12Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:681Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18249Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2619Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:9245Issues:0Issues:0

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptLicense:AGPL-3.0Stargazers:20981Issues:0Issues:0

wslcompact

Compacts the size of the ever-growing WSL vhdx images.

Language:PowerShellLicense:GPL-3.0Stargazers:690Issues:0Issues:0

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:50963Issues:0Issues:0

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonLicense:MITStargazers:2715Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2711Issues:0Issues:0

VCoder

VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:249Issues:0Issues:0

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonLicense:MITStargazers:17423Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12823Issues:0Issues:0

mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Language:PythonLicense:Apache-2.0Stargazers:879Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7671Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7362Issues:0Issues:0

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:7604Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:3438Issues:0Issues:0