DrishtiShrrrma

followers

following

stars

Drishti Sushma 's starred repositories

Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Language:PythonMIT4300

Multi-Agent_Pickup_and_Delivery

Implementations of various algorithms used to solve the problem of Multi-Agent Pickup and Delivery (a generalization of Multi-Agent Path Finding).

Language:PythonMIT5100

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonApache-2.0263400

WorkBench

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.

Language:PythonMIT2900

LlavaGuard

Language:PythonApache-2.02200

distilling-step-by-step

Language:PythonApache-2.041300

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.038900

L1B3RT45

JAILBREAK PROMPTS FOR LIBERATING AI MODELS

AGPL-3.0320200

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT670200

maya-dataset-creation

The Repository contains the code for dataset creation for the Training the Maya: Multilingual Aya Model

Language:PythonMIT100

mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

Language:PythonApache-2.09700

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0600

DOSA

Dataset of of Social Artifacts from Different Indian Geographical Subcultures

Language:Jupyter NotebookMIT500

culture-llm

Language:PythonGPL-3.0200

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API

Language:Python17700

M4GT-Bench

Language:Jupyter Notebook700

NAACL-2024-SemEval-TASK-8C

Code for the paper : Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts

Language:Jupyter NotebookMIT200

aya_rm_multilingual

Repository for Aya Expedition Project : Reward Model Multilingual

Language:PythonMIT700

GoldenFace

An Image Processing Library About Calculating Face Golden Ratio, Facial Cosine Similarity and More

Language:PythonMIT3200

face_rating

Face/Beauty Rating with both the traditional ML approaches and Convolutional Neural Network Approach

Language:Jupyter Notebook7200

Multi-LLM-Agent

Language:Python18700

Multi-Agents-Debate

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Language:PythonGPL-3.025800

llm-compression-benchmark

LLM Compression Benchmark

Language:PythonApache-2.01600

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonNOASSERTION53800

mgt-detection-benchmark

Multilingual machine-generated text detection benchmark

Language:Jupyter NotebookGPL-3.0600

llama-cpp-perplexity-scorecard

Run llama.cpp perplexity test and save results to a cloud datastore for analysis and comparison

Language:PythonMIT300

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Language:CudaNOASSERTION1590900

interpretability-starter

🧠 Starter templates for doing interpretability research

6100

PROMST

Automatic prompt optimization framework for multi-step agent tasks.

Language:PDDLMIT1800

promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Language:TypeScriptMIT444700