gmlwns2000

AinL's repositories

sharkshark-4k

Upscale Twitch stream and restream into Twitch or RTMP or File.

Language:PythonNOASSERTION15 2 3

sea-attention

Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)

Language:Python10 30

streaming-llm-triton

OpenAI Triton Implementation of Streaming LLM

Language:Python8 20

sttabt

[ICLR2023] Official code of Sparse Token Transformer with Attention Back-Tracking

Language:Jupyter Notebook6 30

hss001-latex-template

Language:TeXGPL-3.04 10

mlai-cli

Language:Python4 10

hip-ainl

Language:PythonNOASSERTION3 60

cs454-project

CS454 2023 F Team 4

Language:Jupyter Notebook1 10

InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Language:PythonApache-2.0100

pypareto-native

Numba optimized version of `pypareto`. Sorting chains for pareto frontier extraction

Language:PythonMIT100

RULER-hip

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonApache-2.0100

ai-fact-check-accuracy

Language:Jupyter Notebook010

cascading_kv_cache

000

EXAONE-3.5

Official repository for EXAONE 3.5 built by LG AI Research

NOASSERTION000

gmlwns2000

010

gmlwns2000.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

Language:SCSSMIT000

hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Language:Python000

hpc

Language:Python010

image-augmentation-server

Language:Python010

image-lm

Language:Jupyter Notebook010

InfiniteBench-hip

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonMIT000

llmperf-hip

Language:PythonApache-2.0010

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT000

loft-hip

LOFT: A 1 Million+ Token Long-Context Benchmark

Language:PythonApache-2.0000

LongBench-hip

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT000

LongLM

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonMIT000

sglang-hip12

SGLang is a fast serving framework for large language models and vision language models. See hip12-offload-add-offload-cache

Language:PythonApache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

triton-fix-autotune

Development repository for the Triton language and compiler

Language:C++MIT000

vllm-timber

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000