Yiheng Shu (yhshu)

yhshu

Geek Repo

Company:The Ohio State University

Location:Columbus, OH

Home Page:https://yihengshu.github.io

Twitter:@YihengShu

Github PK Tool:Github PK Tool


Organizations
NEUP-Net-Depart

Yiheng Shu's starred repositories

factoid-wiki

Dense X Retrieval: What Retrieval Granularity Should We Use?

License:Apache-2.0Stargazers:107Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5668Issues:0Issues:0

LLM4Chem

Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

Language:JsonnetLicense:Apache-2.0Stargazers:129Issues:0Issues:0

ret-robust

Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Language:Jupyter NotebookLicense:MITStargazers:286Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonLicense:NOASSERTIONStargazers:521Issues:0Issues:0

Semantic-Retrieval-Models

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Stargazers:308Issues:0Issues:0

FuncReAct

A ReAct type thought framework written using OpenAI function calling. Has the ability to BYOA (bring your own actions)

Language:PythonStargazers:14Issues:0Issues:0

parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Language:PythonLicense:Apache-2.0Stargazers:763Issues:0Issues:0
Language:PythonLicense:MITStargazers:148Issues:0Issues:0

LongMem

Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".

Language:PythonLicense:Apache-2.0Stargazers:739Issues:0Issues:0

InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Language:PythonLicense:MITStargazers:206Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:5481Issues:0Issues:0

RESDSQL

The Pytorch implementation of RESDSQL (AAAI 2023).

Language:PythonLicense:MITStargazers:228Issues:0Issues:0

danker

Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.

Language:PythonLicense:GPL-3.0Stargazers:53Issues:0Issues:0

stanford-openie-python

Stanford Open Information Extraction made simple!

Language:PythonLicense:ISCStargazers:623Issues:0Issues:0

thefuzz

Fuzzy String Matching in Python

Language:PythonLicense:MITStargazers:2596Issues:0Issues:0

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RLicense:MITStargazers:6155Issues:0Issues:0

PPODtottl

A script to convert a Google sheet containing PPOD data into an RDF turtle file

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

TableLlama

[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".

Language:PythonLicense:MITStargazers:94Issues:0Issues:0

sub-sentence-encoder

The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".

Language:PythonStargazers:72Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:355Issues:0Issues:0

ppod

An ontology describing the relationships between persons, projects, organizations, and datasets.

Stargazers:1Issues:0Issues:0

OpenBookQA

Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering"

Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:15689Issues:0Issues:0

awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Language:PythonStargazers:1600Issues:0Issues:0