Beast code in Giters

Brian Mount's starred repositories

loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Language:PythonApache-2.011000

ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

Language:PythonMIT37100

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonNOASSERTION51300

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.0465900

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT354300

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0108500

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.0584100

lag-llama

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Language:PythonApache-2.0114000

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION448000

opencap-core

Main OpenCap processing pipeline

Language:PythonApache-2.014300

antibioticsai

Supporting code for the paper "Discovery of a structural class of antibiotics with explainable deep learning"

Language:Jupyter NotebookMIT8100

coffee

Build and iterate on your UI 10x faster with AI - right from your own IDE ☕️

Language:PythonApache-2.0141000

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonApache-2.0208200

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonMIT5522700

spin-model-transformers

Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX

Language:PythonApache-2.03100

mamba

Mamba SSM architecture

Language:PythonApache-2.01201100

self-operating-computer

A framework to enable multimodal models to operate a computer.

Language:PythonMIT840200

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.03599300

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Apache-2.01418000

visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Language:Jupyter NotebookMIT79500

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.0318000

webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Language:PythonApache-2.065700

sqlcoder

SoTA LLM for converting natural language questions to SQL queries

Language:Jupyter NotebookApache-2.0318400

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01849600

awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Language:Python161000

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.0573100

realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

Language:Python37200

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT127600

LLaVA-Interactive-Demo

Language:PythonApache-2.034100

Dynamic3DGaussians

Language:PythonNOASSERTION185200