ColdFusion2001

ColdFusion2001's starred repositories

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookApache-2.0216900

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

206900

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLMIT28200

RetrievalTutorials

Language:Jupyter Notebook63000

gpt-prompt-engineer

Language:Jupyter NotebookMIT925100

ZeroEval

A simple unified framework for evaluating LLMs

Language:PythonApache-2.011100

MoDS

Language:Python10400

Intra-Fusion

Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)

Language:PythonMIT1000

meta-weight-net

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

Language:PythonMIT28000

distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Language:PythonApache-2.0129400

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonApache-2.0282500

llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Language:Python69100

complexity-scaling

gzip Predicts Data-dependent Scaling Laws

Language:PythonMIT3100

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonMIT102900

modula

Scalable neural net training via automatic normalization in the modular norm.

Language:Jupyter NotebookMIT10100

logix

AI Logging for Interpretability and Explainability🔬

Language:PythonApache-2.06900

LOMO

LOMO: LOw-Memory Optimization

Language:PythonMIT96400

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.0191500

model-explorer

A modern model graph visualizer and debugger

Language:JavaScriptApache-2.096300

zett

Code for Zero-Shot Tokenizer Transfer

Language:Python10700

arc-dsl

Domain Specific Language for the Abstraction and Reasoning Corpus

Language:PythonMIT13100

KAN-GPT-2

Training small GPT-2 style models using Kolmogorov-Arnold networks.

Language:Python10000

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonMIT67800

matryoshka-representation-learning

PyTorch implementation for MRL

Language:Python1600

energy-transformer-torch

Official Implementation of Energy Transformer in PyTorch for Mask Image Reconstruction

Language:PythonMIT1700

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.048400

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.0103100

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1418500

MiniMoE

Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"

Language:PythonApache-2.02800

llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.

Language:Jupyter NotebookApache-2.019000