owl-10's starred repositories

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:11479Issues:121Issues:683
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7387Issues:64Issues:188

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5707Issues:32Issues:130

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4390Issues:40Issues:427

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4360Issues:43Issues:178

introRL

Intro to Reinforcement Learning (强化学习纲要)

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2861Issues:28Issues:179

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:2019Issues:19Issues:46

ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Language:PythonLicense:NOASSERTIONStargazers:1211Issues:11Issues:170

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:905Issues:13Issues:43

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:715Issues:11Issues:38

mindnlp

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

Language:PythonLicense:Apache-2.0Stargazers:675Issues:10Issues:330

Crop-CLIP

Crop using CLIP

Language:Jupyter NotebookLicense:MITStargazers:334Issues:5Issues:1

fc-clip

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Language:PythonLicense:Apache-2.0Stargazers:278Issues:16Issues:37

EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Language:PythonLicense:Apache-2.0Stargazers:244Issues:6Issues:23
Language:PythonLicense:MITStargazers:208Issues:10Issues:8

HUST_EIC_Intro

:label: 华中科技大学电信学院-电信专业 的课程分享与攻略

Language:PythonLicense:CC-BY-SA-4.0Stargazers:178Issues:3Issues:1

SED

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Language:PythonLicense:Apache-2.0Stargazers:112Issues:1Issues:24

DiG

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Language:PythonLicense:MITStargazers:106Issues:4Issues:2

mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Language:PythonLicense:Apache-2.0Stargazers:77Issues:5Issues:3

BoxTeacher

[CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance Segmentation

Language:PythonLicense:MITStargazers:74Issues:7Issues:12

osp

[ECCV 2024] Occupancy as Set of Points

Language:PythonLicense:MITStargazers:63Issues:6Issues:4

WeakSAM

[ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

Language:PythonLicense:MITStargazers:32Issues:3Issues:2

Codecfake

This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".

awesome-HUST-EIC

华中科技大学电信学院启明数理提高班 课程资料 总结/分享计划🔥🔥🔥awesome-HUST-EIC

Language:CStargazers:20Issues:1Issues:0

coframe-public

Coframe lets your website magically improve itself.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:0