rhinojosa's repositories
attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, and without retraining
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
autollm
Ship RAG based LLM web apps in seconds.
awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
azure-open-ai-embeddings-qna
A simple web application for a OpenAI-enabled document search. This repo uses Azure OpenAI Service for creating embeddings vectors from documents. For answering the question of a user, it retrieves the most relevant document and then uses GPT-3, GPT-3.5 or GPT-4 to extract the matching answer for the question.
business-process-automation
Business process automation solution accelerator using Azure services
Clustering-with-LLM
A customer segmentation project can be approached in multiple ways. In this repository, we will explore advanced techniques for defining clusters and analyzing the results.
cwd-benchmark-data
Data for the Chat With Your Data benchmark.
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
kani
kani (カニ) is a highly hackable microframework for chat-based language models with tool usage/function calling.
LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
LLM-Finetuning
LLM Finetuning with peft
lm-hackers
Hackers' Guide to Language Models
multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
narrator
David Attenborough narrates your life
norfair
Lightweight Python library for adding real-time multi-object tracking to any detector.
rags
Build ChatGPT over your data, all with natural language
screenshot-to-code
Drop in a screenshot and convert it to clean HTML/Tailwind/JS code
TotalSegmentator
Tool for robust segmentation of >100 important anatomical structures in CT images
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vid2avatar
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
webcamGPT
webcamGPT - chat with video stream 💬 + 📸
Yi
A series of large language models trained from scratch by developers @01-ai