XiangBaoSong's starred repositories

BAKU

Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning

Language:PythonStargazers:57Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20869Issues:0Issues:0

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4541Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:670Issues:0Issues:0

NEKO

In Progress Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks

Language:PythonLicense:GPL-3.0Stargazers:40Issues:0Issues:0

Gato-A-Generalist-Agent

Minimal code for A Generalist Agent

Language:PythonStargazers:34Issues:0Issues:0

gato

Unofficial Gato: A Generalist Agent

Language:PythonLicense:MITStargazers:191Issues:0Issues:0

DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Language:PythonLicense:NOASSERTIONStargazers:2041Issues:0Issues:0

mvp

Masked Visual Pre-training for Robotics

Language:PythonStargazers:204Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7029Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7391Issues:0Issues:0

home-robot

Mobile manipulation research tools for roboticists

Language:PythonLicense:MITStargazers:827Issues:0Issues:0

vlmaps

[ICRA2023] Implementation of Visual Language Maps for Robot Navigation

Language:PythonLicense:MITStargazers:326Issues:0Issues:0

visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Language:PythonLicense:MITStargazers:456Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4410Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54470Issues:0Issues:0

SLBR-Visible-Watermark-Removal

[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement

Language:PythonStargazers:197Issues:0Issues:0

XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Language:PythonLicense:MITStargazers:1670Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45734Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6307Issues:0Issues:0

chatgpt-on-wechat

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Language:PythonLicense:MITStargazers:28569Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4155Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:245Issues:0Issues:0

C-VTON

C-VTON: Context-Driven Image-Based Virtual Try-On Network

Language:PythonLicense:EPL-2.0Stargazers:137Issues:0Issues:0

PAL4Inpaint

Perceptual Artifacts Localization for Inpainting, ECCV 2022 (Oral)

Language:PythonLicense:NOASSERTIONStargazers:48Issues:0Issues:0

anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Language:PythonLicense:Apache-2.0Stargazers:3486Issues:0Issues:0

dressing-in-order

(ICCV'21) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing" by Aiyu Cui, Daniel McKee and Svetlana Lazebnik

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:507Issues:0Issues:0

HR-VITON

Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).

Language:PythonStargazers:818Issues:0Issues:0

Tengine

Tengine is a lite, high performance, modular inference engine for embedded device

Language:C++License:Apache-2.0Stargazers:4592Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19827Issues:0Issues:0