Sagor Sarker (sagorbrur)

sagorbrur

Geek Repo

Company:@hishab-nlp

Location:Dhaka, Bangladesh

Home Page:https://sagorbrur.github.io

Twitter:@sagor_sarker

Github PK Tool:Github PK Tool

Sagor Sarker's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31401Issues:198Issues:4870

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:27652Issues:302Issues:87

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26280Issues:216Issues:240

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13509Issues:100Issues:1044

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13166Issues:92Issues:16

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonLicense:Apache-2.0Stargazers:11248Issues:97Issues:220

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8818Issues:98Issues:1305

cudf

cuDF - GPU DataFrame Library

Language:C++License:Apache-2.0Stargazers:8294Issues:152Issues:6405

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6579Issues:71Issues:1728

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:6079Issues:50Issues:1011

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:4010Issues:46Issues:560

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellLicense:CC0-1.0Stargazers:3777Issues:66Issues:8

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Language:PythonLicense:BSD-3-ClauseStargazers:3158Issues:38Issues:278
Language:PythonLicense:Apache-2.0Stargazers:2671Issues:36Issues:37

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonLicense:MITStargazers:2518Issues:48Issues:164

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1959Issues:44Issues:120

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1870Issues:34Issues:1070

agentops

Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Language:PythonLicense:MITStargazers:1724Issues:22Issues:93

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:660Issues:12Issues:24

biniou

a self-hosted webui for 30+ generative ai

Language:PythonLicense:GPL-3.0Stargazers:451Issues:11Issues:23

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:440Issues:33Issues:5

llm_distillation_playbook

Best practices for distilling large language models.

Language:Jupyter NotebookStargazers:376Issues:12Issues:0

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages

Language:PythonLicense:MITStargazers:88Issues:8Issues:3

LLMEvaluation

A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.

Language:HTMLStargazers:50Issues:0Issues:0

MalayaLLM

A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.

Awesome_Bangla_Datasets

Awesome Bangla Datasets

Stargazers:15Issues:0Issues:0