Peter Yu (yukw777)

yukw777

Geek Repo

Github PK Tool:Github PK Tool

Peter Yu's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35073Issues:346Issues:1685

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14172Issues:127Issues:3262

awesome-tunneling

List of ngrok/Cloudflare Tunnel alternatives and other tunneling software and services. Focus on self-hosting.

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:11819Issues:136Issues:194

ffmpeg-libav-tutorial

FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: πŸ‡ΊπŸ‡Έ πŸ‡¨πŸ‡³ πŸ‡°πŸ‡· πŸ‡ͺπŸ‡Έ πŸ‡»πŸ‡³ πŸ‡§πŸ‡·

Language:CLicense:BSD-3-ClauseStargazers:9685Issues:271Issues:80

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9002Issues:95Issues:616

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5768Issues:75Issues:522

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3514Issues:47Issues:169

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3484Issues:100Issues:159

LLM-As-Chatbot

LLM as a Chatbot Service

Language:PythonLicense:Apache-2.0Stargazers:3242Issues:54Issues:66

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:3208Issues:161Issues:178

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2513Issues:30Issues:148

fadblock

Friendly Adblock for YouTube: A fast, lightweight, and undetectable YouTube Ads Blocker for Chrome, Opera and Firefox.

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++License:Apache-2.0Stargazers:1680Issues:30Issues:238

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:875Issues:9Issues:17

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Language:PythonLicense:Apache-2.0Stargazers:767Issues:11Issues:28

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:604Issues:11Issues:94

pytorch-coviar

Compressed Video Action Recognition

Language:PythonLicense:LGPL-2.1Stargazers:495Issues:12Issues:93

XPretrain

Multi-modality pre-training

Language:PythonLicense:NOASSERTIONStargazers:446Issues:14Issues:33

llama-docker-playground

Quick Start LLaMA models with multiple methods, and fine-tune 7B/65B with One-Click.

Language:PythonLicense:GPL-3.0Stargazers:345Issues:2Issues:0

OpenGPT

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:322Issues:9Issues:6

chat-with-nerf

Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

Language:PythonLicense:Apache-2.0Stargazers:269Issues:5Issues:11

mv-extractor

Extract frames and motion vectors from H.264 and MPEG-4 encoded video.

Language:CLicense:MITStargazers:252Issues:4Issues:28

cv_emulate

Academic CVs that you can emulate

Language:TeXLicense:NOASSERTIONStargazers:251Issues:4Issues:0

funcX

Globus Compute: High Performance Function Serving for Science

Language:PythonLicense:Apache-2.0Stargazers:134Issues:17Issues:318

RGB-no-more

An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers

Language:ShellLicense:NOASSERTIONStargazers:48Issues:3Issues:2

CoCap

[ICCV 2023] Accurate and Fast Compressed Video Captioning

Language:PythonLicense:MITStargazers:30Issues:3Issues:8

Compressed-Video-Reader

A video reader for extracting motion vectors and residuals from encoded H.264 videos.

Language:CLicense:MITStargazers:11Issues:1Issues:3

gufi-archive

Public Repo of documentation and scripts how to use GUFI to generate reports to identify data suitable for archive

trestles

Exporter for low-level components (e.g. DCT coefficients, MVDs) from the h.264 codec based on the reference implementation.

Language:CLicense:MITStargazers:8Issues:4Issues:1