lilhongxy

Xiangyu Hong's starred repositories

An autoregressive character-level language model for making more things

Language:PythonMIT235400

Language:PythonApache-2.042600

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonMIT56500

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT126800

Language:PythonApache-2.09600

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonApache-2.030400

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonMIT49300

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonApache-2.040000

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT638500

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonMIT7000

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT55600