Beast code in Giters

watayuki's starred repositories

pdf-document-layout-analysis

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

Language:PythonApache-2.011500

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonMIT1195400

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

80000

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause1039700

GoogleEarthVR-saved-renamer

Quick, dirty and portable tool for renaming saved places in Google Earth VR.

Language:C#GPL-3.0300

use-whisper

React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

Language:TypeScriptMIT71000

shap-e

Generate 3D objects conditioned on text or images

Language:PythonMIT1157900

chatgpt-web

ChatGPT web interface using the OpenAI API

Language:SvelteGPL-3.0184900

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonApache-2.0467200

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonGPL-3.01688400

clipseg

This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".

Language:PythonNOASSERTION110900

mmdet-rfla

ECCV22: RFLA

Language:PythonMIT25700

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonMIT307800

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonApache-2.0819100

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6820700

stable-diffusion-webui-docker

Easy Docker setup for Stable Diffusion with user-friendly UI

Language:ShellNOASSERTION663500

stablediffusion-infinity

Outpainting with Stable Diffusion on an infinite canvas

Language:PythonApache-2.0384400

MotionDiffuse

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Language:PythonNOASSERTION83700

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02535800

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonApache-2.0441500

vis-network

:dizzy: Display dynamic, automatically organised, customizable network views.

Language:JavaScriptApache-2.0301400

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION6772100

CLIP4CirDemo

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

Language:SCSS7100

MobileNeRF-Unity-Viewer

An unofficial Unity port of the MobileNeRF viewer

Language:C#MIT39400

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.013279700

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:Jupyter NotebookGPL-3.01326000

SpaceNet8

Algorithmic baseline for SpaceNet 8 Challenge

Language:PythonApache-2.08000

nice-slam

[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

Language:PythonApache-2.0141800

NeuralRecon

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Language:PythonApache-2.0204000

superframe

:package: A super collection of A-Frame components.

Language:JavaScriptMIT136700