Beast code in Giters

A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS program (W209, Data Visualization).

Language:HTMLMIT1000

arxiv-public-datasets

A set of scripts to grab public datasets from resources related to arXiv

Language:PythonMIT38000

arxiv-tools

Tools to bulk download arxiv data

Language:PythonApache-2.011500

SuperCLUE-Math6

SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅

Language:Python3200

Math_Word_Problem_Collection

A collection for math word problem (MWP) works, including datasets, algorithms and so on.

Language:Python2200

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonMIT2334100

temporal-llms

Materials for paper "Are Large Language Models Temporally Grounded?"

Language:PythonMIT800

TempReason

Language:Python2500

TimeLlama

The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.

Language:PythonMIT3000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1195700

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT28000

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0419600

protoqa-data

Dataset for protoqa ("family feud") data

CC-BY-4.03000

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookApache-2.0130200

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

64100

data_tooling

Tools for managing datasets for governance and training.

Language:HTMLApache-2.07400