πŸ€‘πŸ€ (clownrat6)

clownrat6

Geek Repo

Company:🀑 School

Location:🀑 Gotham

Home Page:xxx

Twitter:@xxx

Github PK Tool:Github PK Tool

πŸ€‘πŸ€'s starred repositories

contrastors

Train Models Contrastively in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:493Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9544Issues:0Issues:0

RLAIF-V

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Language:PythonStargazers:173Issues:0Issues:0

SeaLLMs

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

Language:JavaScriptStargazers:138Issues:0Issues:0

MT-LLaMA

Multi-Task instruction-tuned LLaMA

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0
Stargazers:4Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

CGR

code for "Exploiting Reasoning Chains for Multi-hop Science Question Answering"

Language:PythonStargazers:10Issues:0Issues:0
Language:PythonStargazers:21Issues:0Issues:0

CUT

Source code of "Reasons to Reject? Aligning Language Models with Judgments"

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0
Language:PythonStargazers:150Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:10067Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6899Issues:0Issues:0

AdaShield

[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."

Language:PythonStargazers:27Issues:0Issues:0

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:627Issues:0Issues:0

ego4d-goalstep

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

common_metrics_on_video_quality

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

Language:PythonStargazers:177Issues:0Issues:0

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1388Issues:0Issues:0

LongVA

Long Context Transfer from Language to Vision

Language:PythonLicense:Apache-2.0Stargazers:266Issues:0Issues:0

VideoHallucer

VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

tiny-diffusion

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Language:Jupyter NotebookStargazers:609Issues:0Issues:0

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:291Issues:0Issues:0

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:150Issues:0Issues:0

Math-LLaVA

Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:49Issues:0Issues:0

VoCo-LLaMA

VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".

Language:PythonLicense:Apache-2.0Stargazers:67Issues:0Issues:0

Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:136Issues:0Issues:0

WebDesignAgent

WebDesignAgent : Towards Effortless Website Creation

Language:PythonLicense:Apache-2.0Stargazers:225Issues:0Issues:0

Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

Stargazers:110Issues:0Issues:0

medmcqa

A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

Language:Jupyter NotebookLicense:MITStargazers:160Issues:0Issues:0

OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Language:PythonLicense:MITStargazers:211Issues:0Issues:0