smj0's starred repositories

TimeDial

Temporal Commonsense Reasoning in Dialog

Stargazers:69Issues:0Issues:0

TRAM-Benchmark

TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)

Language:Jupyter NotebookLicense:MITStargazers:15Issues:0Issues:0

ToolTalk

Evaluating tool-augmented LLMs in conversation settings

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

iBook

收藏一些电子书

Stargazers:2872Issues:0Issues:0

FollowBench

Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"

Language:PythonLicense:Apache-2.0Stargazers:57Issues:0Issues:0

CELLO

Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)

Language:PythonStargazers:33Issues:0Issues:0

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10803Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8731Issues:0Issues:0

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:458Issues:0Issues:0

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1241Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1852Issues:0Issues:0

SFT_function_learning

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0

AIMasterDevelopers

find the masters, know the masters behind the major project

Stargazers:18Issues:0Issues:0

arxiv2bib

Get a BibTeX entry from an arXiv id number, using the arxiv.org API.

Language:PythonStargazers:49Issues:0Issues:0

arxiv-ai-analysis

A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS program (W209, Data Visualization).

Language:HTMLLicense:MITStargazers:10Issues:0Issues:0

arxiv-public-datasets

A set of scripts to grab public datasets from resources related to arXiv

Language:PythonLicense:MITStargazers:380Issues:0Issues:0

arxiv-tools

Tools to bulk download arxiv data

Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

SuperCLUE-Math6

SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅

Language:PythonStargazers:32Issues:0Issues:0

Math_Word_Problem_Collection

A collection for math word problem (MWP) works, including datasets, algorithms and so on.

Language:PythonStargazers:22Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23341Issues:0Issues:0

temporal-llms

Materials for paper "Are Large Language Models Temporally Grounded?"

Language:PythonLicense:MITStargazers:8Issues:0Issues:0
Language:PythonStargazers:25Issues:0Issues:0

TimeLlama

The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11957Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:280Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4196Issues:0Issues:0

protoqa-data

Dataset for protoqa ("family feud") data

License:CC-BY-4.0Stargazers:30Issues:0Issues:0

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1302Issues:0Issues:0

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

Stargazers:641Issues:0Issues:0

data_tooling

Tools for managing datasets for governance and training.

Language:HTMLLicense:Apache-2.0Stargazers:74Issues:0Issues:0