Steve's starred repositories
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
DeepFashion_Try_On
Official code for "Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content",CVPR‘20 https://arxiv.org/abs/2003.05863
Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。
pytorch-randaugment
Unofficial PyTorch Reimplementation of RandAugment.
torchlayers
Shape and dimension inference (Keras-like) for PyTorch layers and neural networks
mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
fastcampus_slam_codes
Code exercises for the SLAM course in 'Computer Vision, LiDAR processing, and Sensor Fusion for Autonomous Driving' lecture series
airflow-repo-template
The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.
Active-Passive-Losses
[ICML2020] Normalized Loss Functions for Deep Learning with Noisy Labels
Streamlit-Image-Annotation
Streamlit component for image annotation.
MobileOne-PyTorch
A PyTorch implementation of MobileOne
Convolution-From-Scratch
Implementation of the generalized 2D convolution with dilation from scratch in Python and NumPy
vision_transformer_tf
This repository contains the TensorFlow implementation of the paper "AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE" known as vision transformers.