tbergman's repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
ai-comic-factory
Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗
ai-diagram-generator
A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built using LlamaIndex, Vercel AI SDK.
aws-lambda-tesseract-layer
A layer for AWS Lambda containing the tesseract C libraries and tesseract executable.
cog-mvdream-multiview
Cog wrapper for Multi-View Image Generation with MVDream
CopilotKit
A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.
cypress-realworld-app
A payment application to demonstrate real-world usage of Cypress testing methods, patterns, and workflows.
GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
honeycomb
Create hex grids easily, in node or the browser.
morphic
An AI-powered answer engine with a generative UI
objaverse-xl
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
Object-Detection-for-Graphical-User-Interface
Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?
OpenAdapt
AI-First Process Automation with Large [Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
roboflow-notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
sagemaker-annotation-converter
React/Next.js app to convert SageMaker annotation formats.
supervision
We write your reusable computer vision tools. 💜
tesseract-bbox-examples
Complex examples for tesseract.js which can help users to generate and export bbox data of detected words, crop individual images etc.
UIED
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
waveformer
Text to waveform video using MusicGen
zero123
Zero-1-to-3: Zero-shot One Image to 3D Object: https://zero123.cs.columbia.edu/
zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.