Wastoon

Wastoon

Geek Repo

Company:XJTU

Location:中国西安

Home Page:https://wastoon.github.io

Twitter:@Anonymous

Github PK Tool:Github PK Tool

Wastoon's starred repositories

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8150Issues:179Issues:2333

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:6815Issues:37Issues:121

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5686Issues:66Issues:405

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4682Issues:61Issues:78

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonLicense:AGPL-3.0Stargazers:4545Issues:48Issues:165

drake

Model-based design and verification for robotics.

Language:C++License:NOASSERTIONStargazers:3192Issues:174Issues:6186

4K4D

[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Language:PythonLicense:NOASSERTIONStargazers:1504Issues:88Issues:41

GaussianSplats3D

Three.js-based implementation of 3D Gaussian splatting

Language:JavaScriptLicense:MITStargazers:1194Issues:30Issues:174

EPro-PnP

[CVPR 2022 Oral, Best Student Paper] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Language:PythonLicense:Apache-2.0Stargazers:1075Issues:14Issues:89

AnimateAnyone-unofficial

Unofficial Implementation of Animate Anyone

ai-comic-factory

Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗

Language:TypeScriptLicense:Apache-2.0Stargazers:909Issues:11Issues:9

mujoco_mpc

Real-time behaviour synthesis with MuJoCo, using Predictive Control

Language:C++License:Apache-2.0Stargazers:905Issues:25Issues:87

g2opy

Python binding of SLAM graph optimization framework g2o

gaussian_splatting_notes

A detailed formulae explanation on gaussian splatting

SceneLib2

SceneLib2 is an open-source C++ library for SLAM originally designed and implemented by Professor Andrew Davison at Imperial College London.

Language:C++License:NOASSERTIONStargazers:354Issues:42Issues:19

BEV-Perception

Bird's Eye View Perception

Mini-DALLE3

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

ufomap

UFOMap: An Efficient Probabilistic 3D Mapping Framework That Embraces the Unknown

Language:C++License:BSD-2-ClauseStargazers:267Issues:10Issues:14

BVHView

A simple viewer for the .bvh animation file format written using raylib.

Language:CLicense:MITStargazers:231Issues:7Issues:3

metnet3-pytorch

Implementation of MetNet-3, SOTA neural weather model out of Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:194Issues:5Issues:3

VMA

A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element type

Language:PythonLicense:MITStargazers:182Issues:14Issues:15
Language:PythonLicense:MITStargazers:172Issues:5Issues:0

imgfind

根据文本描述搜索本地图片的工具,powered by Rust + candle + CLIP

autodocodec

self(auto)- documenting encoders and decoders

Language:HaskellLicense:MITStargazers:115Issues:6Issues:19

openslam_vertigo

vertigo repos from OpenSLAM.org

Language:C++Stargazers:51Issues:2Issues:0

cmu_vla_challenge_unity

CMU Vision-Language-Autonomy Challenge - Unity Setup

Language:C++Stargazers:42Issues:0Issues:0

dog_rl_deploy

四足机器人强化学习实物部署(Sim to Real)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

DINO

The torch.hub release for AnyLoc (in development, do not use).

Language:PythonLicense:BSD-3-ClauseStargazers:3Issues:2Issues:0