ColdFusion2001

ColdFusion2001

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

ColdFusion2001's starred repositories

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2169Issues:0Issues:0

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

Stargazers:2069Issues:0Issues:0

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLLicense:MITStargazers:282Issues:0Issues:0
Language:Jupyter NotebookStargazers:630Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:9251Issues:0Issues:0

ZeroEval

A simple unified framework for evaluating LLMs

Language:PythonLicense:Apache-2.0Stargazers:111Issues:0Issues:0
Language:PythonStargazers:104Issues:0Issues:0

Intra-Fusion

Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

meta-weight-net

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

Language:PythonLicense:MITStargazers:280Issues:0Issues:0

distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Language:PythonLicense:Apache-2.0Stargazers:1294Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2825Issues:0Issues:0

llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Language:PythonStargazers:691Issues:0Issues:0

complexity-scaling

gzip Predicts Data-dependent Scaling Laws

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:1029Issues:0Issues:0

modula

Scalable neural net training via automatic normalization in the modular norm.

Language:Jupyter NotebookLicense:MITStargazers:101Issues:0Issues:0

logix

AI Logging for Interpretability and Explainability🔬

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0

LOMO

LOMO: LOw-Memory Optimization

Language:PythonLicense:MITStargazers:964Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1915Issues:0Issues:0

model-explorer

A modern model graph visualizer and debugger

Language:JavaScriptLicense:Apache-2.0Stargazers:963Issues:0Issues:0

zett

Code for Zero-Shot Tokenizer Transfer

Language:PythonStargazers:107Issues:0Issues:0

arc-dsl

Domain Specific Language for the Abstraction and Reasoning Corpus

Language:PythonLicense:MITStargazers:131Issues:0Issues:0

KAN-GPT-2

Training small GPT-2 style models using Kolmogorov-Arnold networks.

Language:PythonStargazers:100Issues:0Issues:0

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonLicense:MITStargazers:678Issues:0Issues:0

matryoshka-representation-learning

PyTorch implementation for MRL

Language:PythonStargazers:16Issues:0Issues:0

energy-transformer-torch

Official Implementation of Energy Transformer in PyTorch for Mask Image Reconstruction

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:484Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:1031Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14185Issues:0Issues:0

MiniMoE

Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"

Language:PythonLicense:Apache-2.0Stargazers:28Issues:0Issues:0

llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:190Issues:0Issues:0