nahidalam's repositories

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

latent-scope

A scientific instrument for investigating latent spaces

License:MITStargazers:0Issues:0Issues:0

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

Stargazers:0Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆22 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

License:MITStargazers:0Issues:0Issues:0

LURE

[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

Stargazers:0Issues:0Issues:0

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

License:MITStargazers:0Issues:0Issues:0

awesome-ml

Curated list of useful LLM / Analytics / Datascience resources

License:MITStargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

License:NOASSERTIONStargazers:0Issues:0Issues:0

generative-ai-for-beginners

12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

License:MITStargazers:1Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

gpt4-vision-plugin

Chat with your images using GPT-4 Vision!

Stargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:1Issues:0Issues:0

Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

Stargazers:0Issues:0Issues:0

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

License:NOASSERTIONStargazers:0Issues:0Issues:0

InstructDiffusion

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Awesome-Optical-Flow

This is a list of awesome paper about optical flow and related work.

Stargazers:0Issues:0Issues:0

llm-finetune

LLM Finetune

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

WoodScape

The repository containing tools and information about the WoodScape dataset.

Language:PythonStargazers:0Issues:0Issues:0

meru

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepCamera

Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more

License:MITStargazers:0Issues:0Issues:0

heim

Holistic Evaluation of Text-to-Image Models (HEIM), a fork of HELM to evaluate to text-to-image models (paper coming soon).

License:Apache-2.0Stargazers:0Issues:0Issues:0

GIST-image-text-fine-grained

Generating Image-Specific Text for Fine-grained Object Classification

License:MITStargazers:0Issues:0Issues:0

lightly

A python library for self-supervised learning on images.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-self-supervised-multimodal-learning

A curated list of self-supervised multimodal learning resources.

Stargazers:0Issues:0Issues:0