talha1503

Talha Chafekar's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049458 562 209

supervision

We write your reusable computer vision tools. 💜

Language:PythonMIT22869 156 419

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021765 184 490

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Language:PythonMIT13398 96 374

outlines

Structured Text Generation

Language:PythonApache-2.08187 47 553

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonMIT5652 51 564

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookMIT4524 77 5

multimodal-maestro

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

Language:PythonApache-2.01308 18 13

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

MIT1008 28 5

awesome-multi-modal-reinforcement-learning

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)

Apache-2.0377 9 1

dragon

[NeurIPS 2022] DRAGON 🐲: Deep Bidirectional Language-Knowledge Graph Pretraining

Language:PythonApache-2.0311 90

Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

MIT257 100

InstructGLM

Language is All a Graph Needs

Language:PythonApache-2.0220 7 9

open-eqa

OpenEQA Embodied Question Answering in the Era of Foundation Models

Language:Jupyter NotebookMIT208 9 10

ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Language:ScalaApache-2.0205 9 33

KGen

Knowledge graphs generation from unstructured text.

Language:PythonNOASSERTION69 4 6

knowledge-graph

Generate knowledge graph from unstructured text

Language:Python67 1 2

text-games

This repository provides text game simulators for research purposes.

Language:PythonMIT42 50

HEMM

Holistic evaluation of multimodal foundation models

Language:PythonMIT39 7 4

AltDiffusion

Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"

Language:Python32 2 4

knowledge-graphs

Building Knowledge Graphs from Unstructured Text

Language:Jupyter Notebook24 2 1

reddit-RL-simulator

This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit

Language:Python20 20

EmbodiedAIxLLMPapers

Papers on integrating large language models with embodied AI

20 70

ContextualUnderstanding-ContrastiveDecoding

Enhancing contextual understanding in large language models through contrastive decoding

Language:PythonNOASSERTION15 4 1

Visage contains an image dataset of images with human annotations on whether or not certain attributes are present or depicted in the image. The attribute may either be stereotypical or non-stereotypical w.r.t. to the identity group in the image. It also contains a list of attributes in English along with annotations about whether they are visual.

Apache-2.07 30