watayuki

watayuki

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

watayuki's starred repositories

pdf-document-layout-analysis

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonLicense:MITStargazers:11954Issues:0Issues:0

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Stargazers:800Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10397Issues:0Issues:0

GoogleEarthVR-saved-renamer

Quick, dirty and portable tool for renaming saved places in Google Earth VR.

Language:C#License:GPL-3.0Stargazers:3Issues:0Issues:0

use-whisper

React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

Language:TypeScriptLicense:MITStargazers:710Issues:0Issues:0

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:11579Issues:0Issues:0

chatgpt-web

ChatGPT web interface using the OpenAI API

Language:SvelteLicense:GPL-3.0Stargazers:1849Issues:0Issues:0

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4672Issues:0Issues:0

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16884Issues:0Issues:0

clipseg

This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".

Language:PythonLicense:NOASSERTIONStargazers:1109Issues:0Issues:0

mmdet-rfla

ECCV22: RFLA

Language:PythonLicense:MITStargazers:257Issues:0Issues:0

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonLicense:MITStargazers:3078Issues:0Issues:0

stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Language:PythonLicense:Apache-2.0Stargazers:8191Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:68207Issues:0Issues:0

stable-diffusion-webui-docker

Easy Docker setup for Stable Diffusion with user-friendly UI

Language:ShellLicense:NOASSERTIONStargazers:6635Issues:0Issues:0

stablediffusion-infinity

Outpainting with Stable Diffusion on an infinite canvas

Language:PythonLicense:Apache-2.0Stargazers:3844Issues:0Issues:0

MotionDiffuse

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Language:PythonLicense:NOASSERTIONStargazers:837Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:25358Issues:0Issues:0

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonLicense:Apache-2.0Stargazers:4415Issues:0Issues:0

vis-network

:dizzy: Display dynamic, automatically organised, customizable network views.

Language:JavaScriptLicense:Apache-2.0Stargazers:3014Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67721Issues:0Issues:0

CLIP4CirDemo

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

Language:SCSSStargazers:71Issues:0Issues:0

MobileNeRF-Unity-Viewer

An unofficial Unity port of the MobileNeRF viewer

Language:C#License:MITStargazers:394Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132797Issues:0Issues:0

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:13260Issues:0Issues:0

SpaceNet8

Algorithmic baseline for SpaceNet 8 Challenge

Language:PythonLicense:Apache-2.0Stargazers:80Issues:0Issues:0

nice-slam

[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

Language:PythonLicense:Apache-2.0Stargazers:1418Issues:0Issues:0

NeuralRecon

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Language:PythonLicense:Apache-2.0Stargazers:2040Issues:0Issues:0

superframe

:package: A super collection of A-Frame components.

Language:JavaScriptLicense:MITStargazers:1367Issues:0Issues:0