杨宇克 (yangyuke001)

yangyuke001

Geek Repo

Company:Zhejiang University

Location:hangzhou

Github PK Tool:Github PK Tool

杨宇克's starred repositories

3D-VLA

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Language:PythonStargazers:250Issues:0Issues:0

SegmentAnything3D

[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes

Language:PythonLicense:MITStargazers:909Issues:0Issues:0

Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

Stargazers:759Issues:0Issues:0

RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Language:PythonLicense:MITStargazers:318Issues:0Issues:0

embodied-agents

Seamlessly integrate state-of-the-art transformer models into robotics stacks

Language:PythonLicense:Apache-2.0Stargazers:134Issues:0Issues:0

robotic-transformer-pytorch

Implementation of RT1 (Robotic Transformer) in Pytorch

Language:PythonLicense:MITStargazers:361Issues:0Issues:0

habitat-sim

A flexible, high-performance 3D simulator for Embodied AI research.

Language:C++License:MITStargazers:2481Issues:0Issues:0
License:Apache-2.0Stargazers:204Issues:0Issues:0

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15649Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8059Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6493Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3017Issues:0Issues:0

dino-tracker

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”

Language:PythonLicense:MITStargazers:316Issues:0Issues:0

narrator

David Attenborough narrates your life

Language:PythonStargazers:4330Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6185Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4047Issues:0Issues:0

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6526Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:2296Issues:0Issues:0

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

Language:PythonLicense:Apache-2.0Stargazers:3793Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27690Issues:0Issues:0

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:642Issues:0Issues:0

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:343Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24103Issues:0Issues:0

SD-inference

Stable Diffusion inference

Language:PythonLicense:MITStargazers:184Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3511Issues:0Issues:0

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1593Issues:0Issues:0

Awesome-ChatGPT

ChatGPT资料汇总学习,持续更新......

Stargazers:4035Issues:0Issues:0

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:5166Issues:0Issues:0

IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language:PythonLicense:Apache-2.0Stargazers:18300Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4067Issues:0Issues:0