Cat Yung (catyung)

catyung

Geek Repo

Company:Super Cat Technology Limited

Location:Hong Kong

Home Page:www.super-cat.tech

Github PK Tool:Github PK Tool

Cat Yung's starred repositories

llama.cpp

LLM inference in C/C++

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49179Issues:563Issues:202

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23666Issues:317Issues:386

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:14989Issues:130Issues:341

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5705Issues:37Issues:285

LLocalSearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

Language:GoLicense:Apache-2.0Stargazers:5421Issues:28Issues:90

LaVague

Large Action Model framework to develop AI Web Agents

Language:PythonLicense:Apache-2.0Stargazers:4988Issues:49Issues:195

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4345Issues:48Issues:396

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4104Issues:46Issues:252

simpleui

A modern theme based on vue+element-ui for django admin.一款基于vue+element-ui的django admin现代化主题。全球20000+网站都在使用!喜欢可以点个star✨

Language:PythonLicense:MITStargazers:3384Issues:66Issues:354

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2125Issues:33Issues:96

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:2079Issues:45Issues:168

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1556Issues:20Issues:95

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1152Issues:27Issues:91

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:665Issues:11Issues:24

django-tutorial

Django 基本教學 - 從無到有 Django-Beginners-Guide 📝

Language:PythonLicense:MITStargazers:478Issues:30Issues:1

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:473Issues:15Issues:0

GP-VTON

Official Implementation for CVPR2023 paper "GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning"

repeng

A library for making RepE control vectors

Language:Jupyter NotebookLicense:MITStargazers:419Issues:6Issues:19

Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

MoAI

Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks. (Under Review)

Language:PythonLicense:MITStargazers:297Issues:10Issues:20

transformer-heads

Toolkit for attaching, training, saving and loading of new heads for transformer models

Language:Jupyter NotebookLicense:MITStargazers:209Issues:5Issues:1

MTL-TabNet

MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition

Language:PythonLicense:Apache-2.0Stargazers:79Issues:2Issues:18

vllm-ra

vLLM with RelayAttention integration

Language:PythonLicense:Apache-2.0Stargazers:24Issues:1Issues:2

blitz-embed

C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.

Language:C++License:MITStargazers:21Issues:0Issues:0

orc

🧌 Parsing structured information from OCR outputs

Language:Jupyter NotebookLicense:MITStargazers:17Issues:6Issues:2
Language:Jupyter NotebookStargazers:15Issues:1Issues:0