Drishti Sushma 's starred repositories

Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"

Language:PythonLicense:MITStargazers:43Issues:0Issues:0

Multi-Agent_Pickup_and_Delivery

Implementations of various algorithms used to solve the problem of Multi-Agent Pickup and Delivery (a generalization of Multi-Agent Path Finding).

Language:PythonLicense:MITStargazers:51Issues:0Issues:0

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:2634Issues:0Issues:0

WorkBench

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.

Language:PythonLicense:MITStargazers:29Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:22Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:413Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:389Issues:0Issues:0

L1B3RT45

JAILBREAK PROMPTS FOR LIBERATING AI MODELS

License:AGPL-3.0Stargazers:3202Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6702Issues:0Issues:0

maya-dataset-creation

The Repository contains the code for dataset creation for the Training the Maya: Multilingual Aya Model

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

Language:PythonLicense:Apache-2.0Stargazers:97Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

DOSA

Dataset of of Social Artifacts from Different Indian Geographical Subcultures

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API

Language:PythonStargazers:177Issues:0Issues:0
Language:Jupyter NotebookStargazers:7Issues:0Issues:0

NAACL-2024-SemEval-TASK-8C

Code for the paper : Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

aya_rm_multilingual

Repository for Aya Expedition Project : Reward Model Multilingual

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

GoldenFace

An Image Processing Library About Calculating Face Golden Ratio, Facial Cosine Similarity and More

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

face_rating

Face/Beauty Rating with both the traditional ML approaches and Convolutional Neural Network Approach

Language:Jupyter NotebookStargazers:72Issues:0Issues:0
Language:PythonStargazers:187Issues:0Issues:0

Multi-Agents-Debate

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Language:PythonLicense:GPL-3.0Stargazers:258Issues:0Issues:0

llm-compression-benchmark

LLM Compression Benchmark

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonLicense:NOASSERTIONStargazers:538Issues:0Issues:0

mgt-detection-benchmark

Multilingual machine-generated text detection benchmark

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:6Issues:0Issues:0

llama-cpp-perplexity-scorecard

Run llama.cpp perplexity test and save results to a cloud datastore for analysis and comparison

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Language:CudaLicense:NOASSERTIONStargazers:15909Issues:0Issues:0

interpretability-starter

šŸ§  Starter templates for doing interpretability research

Stargazers:61Issues:0Issues:0

PROMST

Automatic prompt optimization framework for multi-step agent tasks.

Language:PDDLLicense:MITStargazers:18Issues:0Issues:0

promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Language:TypeScriptLicense:MITStargazers:4447Issues:0Issues:0