Hao Zhao (MarcellusZhao)

MarcellusZhao

Geek Repo

Company:École Polytechnique Fédérale de Lausanne

Location:Laussane, Switzerland

Home Page:https://marcelluszhao.github.io/

Github PK Tool:Github PK Tool

Hao Zhao's starred repositories

just-eval

A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.

Language:PythonLicense:MITStargazers:74Issues:0Issues:0

GPT-Fathom

GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.

Language:PythonLicense:MITStargazers:351Issues:0Issues:0

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

License:Apache-2.0Stargazers:7360Issues:0Issues:0

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonLicense:MITStargazers:375Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133155Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1750Issues:0Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:2839Issues:0Issues:0

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:4000Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2073Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1218Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9593Issues:0Issues:0

why-weight-decay

Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]

Language:PythonLicense:NOASSERTIONStargazers:48Issues:0Issues:0

AdaLoRA

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).

Language:PythonLicense:MITStargazers:261Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8460Issues:0Issues:0

sam-low-rank-features

Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]

Language:Jupyter NotebookStargazers:24Issues:0Issues:0

instagraph

Converts text input or URL into knowledge graph and displays

Language:PythonLicense:MITStargazers:3456Issues:0Issues:0

Flash-Attention-Softmax-N

CUDA and Triton implementations of Flash Attention with SoftmaxN.

Language:PythonLicense:GPL-3.0Stargazers:66Issues:0Issues:0

DomainBed

DomainBed is a suite to test domain generalization algorithms

Language:PythonLicense:MITStargazers:1388Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7732Issues:0Issues:0

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:52539Issues:0Issues:0
Language:PythonLicense:CC0-1.0Stargazers:25Issues:0Issues:0

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2545Issues:0Issues:0

awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Language:PythonLicense:MITStargazers:1424Issues:0Issues:0

awesome-obsidian

🕶️ Awesome stuff for Obsidian

Language:CSSLicense:CC0-1.0Stargazers:6714Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:36621Issues:0Issues:0

loss-landscapes

Approximating neural network loss landscapes in low-dimensional parameter subspaces for PyTorch

Language:PythonLicense:MITStargazers:296Issues:0Issues:0

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:1503Issues:0Issues:0

awesome-source-free-test-time-adaptation

A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

Stargazers:458Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:3927Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6648Issues:0Issues:0