XuexII

XuexII

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

XuexII's starred repositories

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2628Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30040Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:4260Issues:0Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:6077Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1872Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1064Issues:0Issues:0

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:19903Issues:0Issues:0

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:316Issues:0Issues:0

reasoning-teacher

Official code for "Large Language Models Are Reasoning Teachers", ACL 2023

Language:Jupyter NotebookLicense:MITStargazers:304Issues:0Issues:0

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3506Issues:0Issues:0
Language:PythonStargazers:35Issues:0Issues:0

sqllineage

SQL Lineage Analysis Tool powered by Python

Language:PythonLicense:MITStargazers:1258Issues:0Issues:0
Language:PythonLicense:MITStargazers:108Issues:0Issues:0

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3732Issues:0Issues:0

conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:929Issues:0Issues:0

GenAI-System-2-Attention-S2A-by-Meta

datasets from the paper "Towards Understanding Sycophancy in Language Models"

Stargazers:1Issues:0Issues:0

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1376Issues:0Issues:0

SymbCoT

Codes and Data for ACL 2024 Paper "Faithful Logical Reasoning via Symbolic Chain-of-Thought".

Language:PythonLicense:MITStargazers:137Issues:0Issues:0
Language:PythonLicense:MITStargazers:4418Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:1078Issues:0Issues:0

Platypus

Code for fine-tuning Platypus fam LLMs using LoRA

Language:PythonStargazers:625Issues:0Issues:0

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:950Issues:0Issues:0

alpagasus

Unofficial implementation of AlpaGasus

Language:PythonStargazers:82Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1145Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:606Issues:0Issues:0

perspectiveapi

Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.

License:Apache-2.0Stargazers:881Issues:0Issues:0

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:1477Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1683Issues:0Issues:0

orpo

Official repository for ORPO

Language:PythonLicense:Apache-2.0Stargazers:396Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1279Issues:0Issues:0