Sagor Sarker (sagorbrur)

sagorbrur

Geek Repo

Company:@hishab-nlp

Location:Dhaka, Bangladesh

Home Page:https://sagorbrur.github.io

Twitter:@sagor_sarker

Github PK Tool:Github PK Tool

Sagor Sarker's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:30289Issues:194Issues:4738

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:28089Issues:155Issues:8594

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25965Issues:212Issues:234

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13221Issues:99Issues:1040

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:12988Issues:94Issues:16

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonLicense:Apache-2.0Stargazers:11173Issues:97Issues:216

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8730Issues:97Issues:1285

cudf

cuDF - GPU DataFrame Library

Language:C++License:Apache-2.0Stargazers:8226Issues:150Issues:6376

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6473Issues:71Issues:1700

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5998Issues:49Issues:1004

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3905Issues:44Issues:496

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellLicense:CC0-1.0Stargazers:3690Issues:65Issues:8

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Language:PythonLicense:BSD-3-ClauseStargazers:3083Issues:37Issues:264
Language:PythonLicense:Apache-2.0Stargazers:2642Issues:35Issues:33

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonLicense:MITStargazers:2509Issues:48Issues:164

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1912Issues:45Issues:116

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1849Issues:35Issues:1066

agentops

Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Language:PythonLicense:MITStargazers:1658Issues:20Issues:85

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:652Issues:13Issues:21

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:424Issues:33Issues:5

fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages

Language:PythonLicense:MITStargazers:88Issues:8Issues:3

LLMEvaluation

A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.

Language:HTMLStargazers:44Issues:0Issues:0

MalayaLLM

A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.

Awesome_Bangla_Datasets

Awesome Bangla Datasets

Stargazers:15Issues:0Issues:0

CommunityLM

[COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models

Language:Jupyter NotebookLicense:CC0-1.0Stargazers:13Issues:4Issues:0