Khalid 's repositories
alpaca-lora
Instruct-tune LLaMA on consumer hardware
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
CLIP_benchmark
CLIP-like model evaluation
lm-evaluation-harness
A framework for few-shot evaluation of language models.
promptsource
Toolkit for collecting and applying prompts
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
improved-diffusion
Text-writing denoising diffusion (and much more)
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Multilingual-CLIP
OpenAI CLIP text encoders for multiple languages!
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NeMo-RL
Scalable toolkit for efficient model reinforcement
nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
open_clip
An open source implementation of CLIP.
pts
Pivotal Token Search
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
sentence-splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ultravox
A fast multimodal LLM for real-time voice