Yueqian Lin (linyueqian)

linyueqian

Geek Repo

Company:Duke University

Location:Durham, NC

Home Page:yueqianlin.com

Twitter:@YueqianL

Github PK Tool:Github PK Tool

Yueqian Lin's starred repositories

VideoLISA

[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

Stargazers:6Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29387Issues:0Issues:0

LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:93Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:5187Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5876Issues:0Issues:0

Bay-CAT

[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Language:PythonLicense:Apache-2.0Stargazers:36Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:561Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3223Issues:0Issues:0

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Language:PythonLicense:Apache-2.0Stargazers:2089Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27681Issues:0Issues:0

Call-for-Reviewers

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

License:MITStargazers:343Issues:0Issues:0

LinFusion

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Language:PythonLicense:Apache-2.0Stargazers:210Issues:0Issues:0

VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Language:PythonLicense:Apache-2.0Stargazers:183Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:374Issues:0Issues:0

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonLicense:MITStargazers:169Issues:0Issues:0

WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Language:PythonLicense:MITStargazers:681Issues:0Issues:0

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language:PythonLicense:Apache-2.0Stargazers:878Issues:0Issues:0

Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

Stargazers:148Issues:0Issues:0

RoFormer_pytorch

RoFormer V1 & V2 pytorch

Language:PythonLicense:Apache-2.0Stargazers:467Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:11934Issues:0Issues:0

llama-stack

Model components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:2672Issues:0Issues:0

llama-models

Utilities intended for use with Llama models.

Language:PythonLicense:NOASSERTIONStargazers:4254Issues:0Issues:0

modin

Modin: Scale your Pandas workflows by changing a single line of code

Language:PythonLicense:Apache-2.0Stargazers:9806Issues:0Issues:0

kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

Language:PythonLicense:Apache-2.0Stargazers:9875Issues:0Issues:0

AI-System-School

🚀 AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

License:MITStargazers:2659Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31786Issues:0Issues:0

homeassistant-smartrent

Home Assistant Custom Component for SmartRent Locks 🔐, Thermostats 🌡, Sensors 💧 and Switches💡

Language:PythonLicense:MITStargazers:83Issues:0Issues:0

icloud_photos_downloader

A command-line tool to download photos from iCloud

Language:PythonLicense:MITStargazers:6676Issues:0Issues:0

lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, qwen-vl, phi3-v etc.

Language:PythonLicense:Apache-2.0Stargazers:134Issues:0Issues:0

Awesome-OOD-VLM

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]

Stargazers:55Issues:0Issues:0