Shamane Siri's repositories
llm-autoeval
Automatically evaluate your LLMs in Google Colab
sementic-search-with-PEFT
Semantic Search with PEFT and Transformers
LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
Minikube-tutorial
Just wanted to explore minikube to learn k8
ADAS
Automated Design of Agentic Systems
Agentic---Gen-AI
All projects related to Agentic and Gen AI
alignment-handbook
Robust recipes to align language models with human and AI preferences
appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.
autogen-upstream
A programming framework for agentic AI 🤖
Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
checkpoint-upload
This is to upload-small-checkpoints
deepscaler
Democratizing Reinforcement Learning for LLMs
e17-4yp-Large-Language-Models-in-Education
The project targets to explore the use of Large Language models in education and develop an intelligent tutor.
examples
repository of example scripts, notebooks, projects
judges
A small library of LLM judges
llm4rec-awesome-papers
A list of awesome papers and resources of recommender system on large language model (LLM).
Marco-o1
An Open Large Reasoning Model for Real-World Solutions
Megatron-LM
Ongoing research training transformer models at scale
optillm
Optimizing inference proxy for LLMs
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
verifiers
Verifiers for LLM Reinforcement Learning
verl
veRL: Volcano Engine Reinforcement Learning for LLM
VQ-Rec
[WWW'23] PyTorch implementation for "Learning Vector-Quantized Item Representation for Transferable Sequential Recommenders".