m-pedro

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonMIT23300

generative-ai-amazon-bedrock-langchain-agent-example

Language:PythonMIT-020500

distributed-kge-poplar

The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise the WikiKG90Mv2 dataset

Language:C++MIT1300

ThoughtSource

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Language:Jupyter NotebookMIT85600

amazon-sagemaker-generativeai

Repository for training and deploying Generative AI models, including text-text, text-to-image generation and prompt engineering playground using SageMaker Studio.

Language:Jupyter NotebookMIT-012100

guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-aws

This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like those available through Amazon Bedrock, to different business units or teams within your organization

Language:Jupyter NotebookMIT-05000

Lynx-hallucination-detection

Language:Python1500

RydbergGPT

Our LLM for Rydberg atom physics

Language:PythonApache-2.03000

zeroshot-classifier

Notebooks for training universal 0-shot classifiers on many different tasks

Language:Jupyter NotebookApache-2.09800

summer-school-transformers-2023

Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023

Language:Jupyter Notebook8200

grade-school-math

Language:Python94800

NeMo-Skills

A pipeline to improve skills of large language models

Language:PythonApache-2.013100

m-pedro

Peter Morgan's starred repositories

DeepSeek-MoE

Look-into-MoEs

Step-DPO

lbt

aimo-progress-prize

mergekit

refusal_direction

DeepSeek-Coder-V2

dclm

HiddenMambaAttn

samplernn-pytorch

flash-linear-attention

Samba

mdlm

scaling-with-vocab

unsloth

exa-crewai

magpie