Zhenmei Shi (zhmeishi)

zhmeishi

Geek Repo

Company:Google

Location:Mountain View, CA

Home Page:zhmeishi.github.io

Github PK Tool:Github PK Tool

Zhenmei Shi's starred repositories

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:36888Issues:437Issues:288

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18297Issues:155Issues:467

llama2.c

Inference Llama 2 in one file of pure C

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:6876Issues:58Issues:184

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4107Issues:41Issues:157
Language:PythonLicense:MITStargazers:4050Issues:149Issues:33

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2504Issues:13Issues:167

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1717Issues:33Issues:1027

requests-ip-rotator

A Python library to utilize AWS API Gateway's large IP pool as a proxy to generate pseudo-infinite IPs for web scraping and brute forcing.

Language:PythonLicense:GPL-3.0Stargazers:1247Issues:17Issues:59

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1167Issues:12Issues:25

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:1069Issues:20Issues:17

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:1007Issues:20Issues:19

LongLM

[ICML'24] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:523Issues:9Issues:33

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:232Issues:8Issues:14

InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonLicense:MITStargazers:200Issues:9Issues:14

clone-anonymous-github

Easily download anonymous Github repositories from https://anonymous.4open.science/ with a GUI interface

amago

a simple and scalable agent for training adaptive policies with sequence-based RL

Language:PythonLicense:MITStargazers:70Issues:1Issues:4
Language:PythonLicense:MITStargazers:56Issues:1Issues:2

grokking

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Language:PythonLicense:MITStargazers:47Issues:4Issues:4
Language:Jupyter NotebookStargazers:36Issues:1Issues:0

decision-pretrained-transformer

Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learning.

grokking

Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.

Language:PythonLicense:MITStargazers:18Issues:1Issues:2

NRA_tax_filing

最新税季 UW-Madison报税指南

Language:RLicense:MITStargazers:14Issues:1Issues:0

CoBSAT

Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"

Language:Jupyter NotebookStargazers:13Issues:1Issues:0

k_conv_basis

Llama3 for Conv Basis

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:PythonStargazers:1Issues:1Issues:0