Fangkai Jiao (SparkJiao)

SparkJiao

Geek Repo

Company:NTU-NLP & I2R, A*STAR, Singapore

Location:Sinagpore

Home Page:jiaofangkai.com

Github PK Tool:Github PK Tool

Fangkai Jiao's starred repositories

LLMSanitize

An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).

Language:PythonStargazers:33Issues:0Issues:0

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonLicense:MITStargazers:1398Issues:0Issues:0

OpenDevin

šŸš OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:27717Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:2033Issues:0Issues:0

ao

Create and integrate high-performance custom data types, layouts and kernels with up to 2x speedups with 65% less VRAM for inference and support for training

Language:PythonLicense:BSD-3-ClauseStargazers:318Issues:0Issues:0

apps

APPS: Automated Programming Progress Standard (NeurIPS 2021)

Language:PythonLicense:MITStargazers:370Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49102Issues:0Issues:0
Language:PythonLicense:MITStargazers:3950Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1826Issues:0Issues:0

DenseSSM

A repository for DenseSSMs

Language:PythonStargazers:84Issues:0Issues:0

Inflection-Benchmarks

Public Inflection Benchmarks

License:MITStargazers:67Issues:0Issues:0

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1215Issues:0Issues:0

llm-planning-eval

Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

Language:PythonStargazers:41Issues:0Issues:0

UNK-VQA

A VQA dataset that includes unanswerable questions.

License:Apache-2.0Stargazers:1Issues:0Issues:0

RLMEC

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Language:PythonStargazers:8Issues:0Issues:0

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:250Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5302Issues:0Issues:0

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonLicense:NOASSERTIONStargazers:209Issues:0Issues:0

dpo-trajectory-reasoning

Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Language:PythonStargazers:12Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:921Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5092Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:5392Issues:0Issues:0

dspy

DSPy: The framework for programmingā€”not promptingā€”foundation models

Language:PythonLicense:MITStargazers:13420Issues:0Issues:0

SeaEval

NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning

Language:PythonLicense:NOASSERTIONStargazers:17Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4154Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3911Issues:0Issues:0

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonLicense:BSD-3-ClauseStargazers:194Issues:0Issues:0

SimulateBench

GPT as Human

Language:PythonStargazers:17Issues:0Issues:0

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:907Issues:0Issues:0