Roy Hermann's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:162909Issues:1557Issues:2224

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:51006Issues:502Issues:462

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:40775Issues:870Issues:542

bark

πŸ”Š Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33343Issues:309Issues:418

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16346Issues:140Issues:232

FLEX

An in-app debugging and exploration tool for iOS

Language:Objective-CLicense:NOASSERTIONStargazers:13904Issues:384Issues:392

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:12079Issues:107Issues:784

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10024Issues:103Issues:139

point-e

Point cloud diffusion for 3D model synthesis

Language:PythonLicense:MITStargazers:6370Issues:225Issues:84

ffmpeg-kit

FFmpeg Kit for applications. Supports Android, Flutter, iOS, Linux, macOS, React Native and tvOS. Supersedes MobileFFmpeg, flutter_ffmpeg and react-native-ffmpeg.

Language:CLicense:LGPL-3.0Stargazers:3917Issues:52Issues:830

OAuthSwift

Swift based OAuth library for iOS

Language:SwiftLicense:MITStargazers:3237Issues:91Issues:466

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3170Issues:39Issues:107

purchases-ios

In-app purchases and subscriptions made easy. Support for iOS, watchOS, tvOS, macOS, and visionOS.

Language:SwiftLicense:MITStargazers:2179Issues:25Issues:487

capture-website

Capture screenshots of websites

Language:JavaScriptLicense:MITStargazers:1891Issues:12Issues:76

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

kangas

🦘 Explore multimedia datasets at scale

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1032Issues:11Issues:15

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

Language:PythonLicense:MITStargazers:944Issues:12Issues:14

latent-nerf

Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"

Language:PythonLicense:MITStargazers:685Issues:36Issues:24

whispering

Streaming transcriber with whisper

Language:PythonLicense:MITStargazers:680Issues:19Issues:41

ChatAnything

Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS

Vega

video editor

Language:TypeScriptLicense:NOASSERTIONStargazers:244Issues:9Issues:23

phenaki

A phenaki reproduction using pytorch.

giffusion

Create GIFs and Videos using Stable Diffusion

async-plus

β›“ A chainable interface for Swift's async/await.

Language:SwiftLicense:MITStargazers:187Issues:1Issues:9

NextMind

Documentation (incl. tutorials, Unity assets and API reference) for the NextMind SDK

InternetOfAgents

Build your Swarm of Internet Agents using MultiOn πŸš€

Language:PythonStargazers:75Issues:3Issues:0

CoreAnimationCode

Code examples of the book "iOS Core Animation Advanced Techniques"

Language:Objective-CStargazers:56Issues:4Issues:0