Mark's repositories
52-technologies-in-2016
Let's learn a new technology every week. A new technology blog every Sunday in 2016.
shot-scraper
A command-line utility for taking automated screenshots of websites
spotify-playlist-archive
Daily snapshots of public Spotify playlists
stable-diffusion-webui
Stable Diffusion web UI
ControlNet
Let us control diffusion models!
ai-notes
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
ArtSpew
An infinite number of monkeys randomly throwing paint at a canvas
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
aws-media-replay-engine
Media Replay Engine (MRE) is a framework to build automated video clipping and replay (highlight) generation pipelines for live and video-on-demand content.
bootstrap
The most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
changedetection.io
The best and simplest free open source website change detection, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monitor which websites had a text change for free. Free Open source web page change detection, Website defacement monitoring, Price change and Price Drop notification
google-research
Google Research
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
kubric
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
prolificdreamer
Official code of ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
Real-Time-Latent-Consistency-Model
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server
roop
one-click face swap
SyntheticMediaGenerator
A deep learning-powered repository for generating personalized video content with user annotations. Utilizes state-of-the-art GANs to synthesize beautiful visuals
text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.