Divebomb's starred repositories

KunQuant

A compiler, optimizer and executor for financial expressions and factors

Language:PythonLicense:Apache-2.0Stargazers:74Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1899Issues:0Issues:0

Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

Language:PythonStargazers:1561Issues:0Issues:0

Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

License:MITStargazers:379Issues:0Issues:0

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Stargazers:1741Issues:0Issues:0
Language:PythonStargazers:36Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49207Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:873Issues:0Issues:0

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:Apache-2.0Stargazers:2944Issues:0Issues:0

ET-BERT

The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.

Language:PythonLicense:MITStargazers:317Issues:0Issues:0

line_profiler

Line-by-line profiling for Python

Language:PythonLicense:NOASSERTIONStargazers:2595Issues:0Issues:0

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3020Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9510Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32252Issues:0Issues:0

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2274Issues:0Issues:0

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:36692Issues:0Issues:0

SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

Stargazers:480Issues:0Issues:0

BERT-LoRA-TensorRT

This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47Issues:0Issues:0

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:4982Issues:0Issues:0

KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

License:MITStargazers:255Issues:0Issues:0

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

Stargazers:1012Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23749Issues:0Issues:0

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1121Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8124Issues:0Issues:0

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Language:PythonLicense:MITStargazers:4774Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34035Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7691Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35177Issues:0Issues:0

PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Language:PythonLicense:NOASSERTIONStargazers:2787Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10875Issues:0Issues:0