varunshenoy / latentverse

Portal hopping with Stable Diffusion πŸ‘Ύ

Home Page:https://latentverseai.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Latentverse πŸ§‘β€πŸš€

Latentverse uses latent consistency models (LCMs) and a custom LoRA (360Redmond) to generate 360-degree equirectangular photos in seconds.

It's like Rick's portal gun, or Google Street View for your imagination.

example animation

With the right set up, you can achieve near realtime generation. With an A100 and 4 steps, each inference takes ~1s. On an A10G (as shown in the video below), you get ~3s.

Getting started

There are two parts of the project, the Next.js frontend app and the backend server hosted on Baseten.

You can deploy the frontend web app locally as follows:

  1. Clone this repo.
  2. cd into webapp
  3. Run npm install

Unfortunately, you can't run the backend locally quite yet! We use Baseten, a serverless GPU provider for ML applications. The deployment folder has a Truss that can be deployed on Baseten.

  1. Create an account on Baseten.
  2. Run pip install truss
  3. Run truss push deployment --publish. You may have to provide authentication credentials here.
  4. Go to your app's dashboard on Baseten and grab the endpoint that has been set up.
  5. Update the endpoint on line 10 in webapp/src/app/api/portal/route.ts.

The last step is creating a .env.local in webapp with the following Baseten credentials:

BASETEN_API_KEY=YOUR API KEY
BASETEN_ENDPOINT=YOUR API ENDPOINT (e.g. https://model-abcdef.api.baseten.co/production/predict)

You can now run npm run dev in webapp. By default, the web app should now be available at https://localhost:3000. Happy portal hopping!

rick and morty

Contributions and Licensing

LatentVerse was built by Varun Shenoy and David Song.

Feel free to provide any contributions you deem necessary or useful. This project is licensed under the MIT License.

About

Portal hopping with Stable Diffusion πŸ‘Ύ

https://latentverseai.com

License:MIT License


Languages

Language:JavaScript 99.6%Language:TypeScript 0.3%Language:Python 0.1%Language:CSS 0.0%