Sagor Sarker (sagorbrur)

sagorbrur

Geek Repo

Company:@hishab-nlp

Location:Dhaka, Bangladesh

Home Page:https://sagorbrur.github.io

Twitter:@sagor_sarker

Github PK Tool:Github PK Tool

Sagor Sarker's starred repositories

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28146Issues:187Issues:4440

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:26789Issues:154Issues:8119

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25012Issues:206Issues:213

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12856Issues:99Issues:1031

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11615Issues:83Issues:14

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8504Issues:99Issues:1237
Language:PythonLicense:Apache-2.0Stargazers:7039Issues:67Issues:69

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6345Issues:71Issues:1654

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5756Issues:48Issues:968
Language:PythonLicense:Apache-2.0Stargazers:2575Issues:33Issues:27

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonLicense:MITStargazers:2476Issues:48Issues:163

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2296Issues:22Issues:45

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1819Issues:43Issues:105

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1805Issues:36Issues:1051

agentops

Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Language:PythonLicense:MITStargazers:1417Issues:20Issues:76

WikiChat

WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.

Language:PythonLicense:Apache-2.0Stargazers:938Issues:15Issues:17

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:620Issues:13Issues:19

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:571Issues:5Issues:15

fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages

Language:PythonLicense:MITStargazers:78Issues:8Issues:3

LLMeBench

Benchmarking Large Language Models

LLMEvaluation

A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.

Language:HTMLStargazers:42Issues:0Issues:0

MalayaLLM

A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.

Awesome_Bangla_Datasets

Awesome Bangla Datasets

Stargazers:15Issues:0Issues:0

CommunityLM

[COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models

Language:Jupyter NotebookLicense:CC0-1.0Stargazers:13Issues:4Issues:0

Bangla-Vulgar-Lexicon

A list of Bengali vulgar words

Stargazers:1Issues:0Issues:0