luckyhwp's repositories
Domain-private-Factor-Detachment-Network
The proposed DFD combines DiRL, CdFD and CAL into an end-to-end framework.
transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
ArtiBoost
[CVPR 2022 Oral] ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis
assembly101-download-scripts
Python scripts to download Assembly101 from Google Drive
awesome-robot-visual-imitation-learning
A collection of papers, codes and talks of visual imitation learning/imitation learning from video for robotics.
cliport
CLIPort: What and Where Pathways for Robotic Manipulation
CogVideo
Text-to-video generation.
dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
DRSA-Exo2Ego-VideoSynthesis
For cue-free E2VG problem, we propose a cue-free video-based approach termed hierarchical Dynamic memory Refinement and Semantic Alignment (DRSA). Moreover, we create a new DSO ExoEgo dataset with dynamic exocentric scenes and rich interacting objects to advance the E2VG research.
FastSAM
Fast Segment Anything
GA-DDPG
6D Grasping Policy from Point Clouds
handover-sim
A simulation environment and benchmark for human-to-robot object handovers
handover-sim2real
Official code for CVPR'23 paper: Learning Human-to-Robot Handovers from Point Clouds
imitation
Clean PyTorch implementations of imitation and reward learning algorithms
IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
LGGAN
[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation
OMG-Planner
An Optimization-based Motion and Grasp Planner
PerceptualSimilarity
LPIPS metric. pip install lpips
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
r3m
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
remp
Rearrangement with Multiple Manipulation Primitives
Sat2StrPanoramaSynthesis
Geometry-Guided Street-View Panorama Synthesis from Satellite Imagery, TPAMI 2022
segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
Ubuntu_16.04_Cuda_Cudnn
Ubuntu_16.04_Cuda_Cudnn setup
Universal_Robots_ROS2_Driver
Universal Robots ROS2 driver supporting CB3 and e-Series
Universal_Robots_ROS_Driver
Universal Robots ROS driver supporting CB3 and e-Series
ur3
ROS-based UR3/UR3e controller with simulation in Gazebo. Adaptable to other UR robots
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models