Peter Morgan (m-pedro)

m-pedro

Geek Repo

Company:AI, Machine Learning, Quantum Computing

Location:London

Home Page:www.deeplp.com

Twitter:@PMZepto

Github PK Tool:Github PK Tool

Peter Morgan's starred repositories

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:928Issues:0Issues:0

Look-into-MoEs

A Closer Look into Mixture-of-Experts in Large Language Models

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Language:PythonStargazers:169Issues:0Issues:0

lbt

Can LLMs Learn by Teaching? A Preliminary Study

Language:PythonLicense:MITStargazers:19Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:137Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

License:LGPL-3.0Stargazers:19Issues:0Issues:0

refusal_direction

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Language:PythonLicense:Apache-2.0Stargazers:50Issues:0Issues:0

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License:MITStargazers:1429Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:552Issues:0Issues:0

HiddenMambaAttn

Official PyTorch Implementation of "The Hidden Attention of Mamba Models"

Language:PythonStargazers:175Issues:0Issues:0

samplernn-pytorch

PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

Language:PythonLicense:MITStargazers:284Issues:0Issues:0

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:767Issues:0Issues:0

Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Language:PythonLicense:MITStargazers:713Issues:0Issues:0

mdlm

Simplified Masked Diffusion Language Model

Language:PythonLicense:Apache-2.0Stargazers:121Issues:0Issues:0

scaling-with-vocab

📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Language:PythonStargazers:30Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12933Issues:0Issues:0
Language:HTMLStargazers:39Issues:0Issues:0

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:233Issues:0Issues:0
Language:PythonLicense:MIT-0Stargazers:205Issues:0Issues:0

distributed-kge-poplar

The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise the WikiKG90Mv2 dataset

Language:C++License:MITStargazers:13Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4409Issues:0Issues:0

ThoughtSource

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Language:Jupyter NotebookLicense:MITStargazers:856Issues:0Issues:0

amazon-sagemaker-generativeai

Repository for training and deploying Generative AI models, including text-text, text-to-image generation and prompt engineering playground using SageMaker Studio.

Language:Jupyter NotebookLicense:MIT-0Stargazers:121Issues:0Issues:0

guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-aws

This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like those available through Amazon Bedrock, to different business units or teams within your organization

Language:Jupyter NotebookLicense:MIT-0Stargazers:50Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

RydbergGPT

Our LLM for Rydberg atom physics

Language:PythonLicense:Apache-2.0Stargazers:30Issues:0Issues:0

zeroshot-classifier

Notebooks for training universal 0-shot classifiers on many different tasks

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:98Issues:0Issues:0

summer-school-transformers-2023

Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023

Language:Jupyter NotebookStargazers:82Issues:0Issues:0
Language:PythonStargazers:948Issues:0Issues:0

NeMo-Skills

A pipeline to improve skills of large language models

Language:PythonLicense:Apache-2.0Stargazers:131Issues:0Issues:0