Talha Chafekar's starred repositories
supervision
We write your reusable computer vision tools. 💜
Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
multimodal-maestro
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
awesome-multi-modal-reinforcement-learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
InstructGLM
Language is All a Graph Needs
ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
knowledge-graph
Generate knowledge graph from unstructured text
text-games
This repository provides text game simulators for research purposes.
AltDiffusion
Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"
knowledge-graphs
Building Knowledge Graphs from Unstructured Text
reddit-RL-simulator
This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit
EmbodiedAIxLLMPapers
Papers on integrating large language models with embodied AI
ContextualUnderstanding-ContrastiveDecoding
Enhancing contextual understanding in large language models through contrastive decoding
visage
Visage contains an image dataset of images with human annotations on whether or not certain attributes are present or depicted in the image. The attribute may either be stereotypical or non-stereotypical w.r.t. to the identity group in the image. It also contains a list of attributes in English along with annotations about whether they are visual.
python-project-template
Template for project development.