Amanda-Barbara

followers

following

stars

Amanda-Barbara's repositories

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

Language:Jupyter NotebookCC-BY-4.0000

blora-text-generation-inference

Batched LORA + Continuous Batching

Language:PythonApache-2.0000

BLoRA-TGI-with-python-server

Batched Lora + Continuous Batching

Language:Python000

ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

Language:C++MIT000

CPlusPlus-Tutorial

C++ Tutorial

Language:C++000

cutlass-flash-attention

flash attention tutorial written in python, triton, cuda, cutlass

Language:Cuda000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

flux

A fast communication-overlapping library for tensor parallelism on GPUs.

Language:C++Apache-2.0000

generative-ai-for-beginners

12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT000

gpu-profiling

GPU Profiling

Language:Jupyter Notebook000

InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Language:PythonApache-2.0000

kohya_ss

train stable diffusion models

Language:PythonApache-2.0000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

Latte

The official implementation of Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonMIT000

leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Language:Shell000

llm-benchmark-test

include LLM open framework measurement

Language:Python000

llm_long_context_bench202405

Language:PythonApache-2.0000

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:Python000

MetaGPT-agent

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonMIT000

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause000

MiniGPT4-video

Language:PythonBSD-3-Clause000

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.0000

multimodal-ai-jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonApache-2.0000

NvidiaTransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0000

text2video-generative-models

Generative Models by Stability AI

Language:PythonMIT000

tgi-benchmarking

Benchmarking LLMs on GPUs

Language:Jupyter Notebook000

tvm-mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.0000

vAttention

Language:C++Apache-2.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000

yolov5-5.x-annotations

一个基于yolov5-5.0的中文注释版本！

Language:Python000