PrinceYen's starred repositories
arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
text-generation-inference
Large Language Model Text Generation Inference
Megatron-LM
Ongoing research training transformer models at scale
TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"