Arka Sadhu (TheShadow29)

TheShadow29

Geek Repo

Company:University of Southern California

Location:Los Angeles, CA, USA

Home Page:https://theshadow29.github.io

Github PK Tool:Github PK Tool

Arka Sadhu's starred repositories

ollama

Get up and running with Llama 2, Mistral, and other large language models locally.

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:32399Issues:282Issues:2064

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:22141Issues:233Issues:249

llama2.c

Inference Llama 2 in one file of pure C

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:13860Issues:159Issues:169

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:8700Issues:96Issues:300

MemGPT

Building persistent LLM agents with long-term memory 📚🦙

Language:PythonLicense:Apache-2.0Stargazers:8564Issues:105Issues:516

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:7980Issues:67Issues:184

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6173Issues:60Issues:72

StableSwarmUI

StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonLicense:MITStargazers:2470Issues:42Issues:116

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2396Issues:25Issues:132

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonLicense:MITStargazers:1833Issues:72Issues:15

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1596Issues:23Issues:210

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookLicense:MITStargazers:1197Issues:7Issues:76

LLMStack

No-code platform to build LLM Agents, workflows and applications with your data

Language:PythonLicense:NOASSERTIONStargazers:1082Issues:16Issues:37

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:622Issues:11Issues:28

plotai

PlotAI - Your Ultimate Plotting Assistant! 📊🤖 Use ChatGPT-3.5 to create plots in Python and Matplotlib directly in your Python script or notebook.

Language:PythonLicense:Apache-2.0Stargazers:284Issues:4Issues:3

diffusion_reading_group

Diffusion Reading Group at EleutherAI

Language:Jupyter NotebookStargazers:284Issues:23Issues:1

Pseudo-Q

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Language:PythonLicense:Apache-2.0Stargazers:137Issues:3Issues:18

X2-VLM

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Language:PythonLicense:BSD-3-ClauseStargazers:112Issues:6Issues:16

Playground

Text WebUI extension to add clever Notebooks to Chat mode

VL-PET

[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"

Language:PythonLicense:MITStargazers:47Issues:2Issues:4

economist_poll

Which Famous Economist Are You Most Similar To? Data from the IGM expert panel poll and code for extracting it.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:30Issues:6Issues:0

eventful-transformer

Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"

Language:PythonLicense:MITStargazers:29Issues:4Issues:3

ORES

ORES: Open-vocabulary Responsible Visual Synthesis

Language:PythonLicense:MITStargazers:11Issues:0Issues:0