Junyang Lin's repositories
ACA4NMT
Code of a novel model for NMT
auto-round
SOTA Weight-only Quantization Algorithm for LLMs
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
axolotl
Go ahead and axolotl questions
BERT-pytorch
Google AI 2018 BERT pytorch implementation
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
KOBE
Towards Knowledge-Based Personalized Product Description Generation in E-commerce
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
llama.cpp
LLM inference in C/C++
mlx-examples
Examples in the MLX framework
python_for_linguists
Python for Linguists – a Gentle Introduction to Programming
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
SWE-bench
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs