iwangjian

followers

following

stars

PolyU

Hong Kong

https://iwangjian.github.io

Organizations

polyunlp

Jian Wang's starred repositories

AutoAct

[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning

Language:PythonApache-2.012300

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.083500

psi

Platform for Situated Intelligence

Language:C#NOASSERTION52300

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.0189900

SALMON

Self-Alignment with Principle-Following Reward Models

Language:PythonGPL-3.012700

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.0153200

HiFT

memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B

Language:PythonApache-2.01300

repoqa

RepoQA: Evaluating Long-Context Code Understanding

Language:PythonApache-2.08100

llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

Language:PythonNOASSERTION65000

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.077100

EvoCodeBench

An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories

Language:PythonApache-2.02200

MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Language:PythonMIT3700

gpt-prompt-engineer

Language:Jupyter NotebookMIT814100

sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Language:PythonMIT11400

sotopia-pi

Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)

Language:PythonApache-2.03800

transformer-debugger

Language:PythonMIT387600

navchat

Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" https://arxiv.org/abs/2310.07968

Language:PythonMIT1400

ExpeL

Language:PythonApache-2.04600

ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Language:Python5900

NLP-Movie_Scripts

Trying to predict a movie's success based on the script (before filming)

Language:Jupyter Notebook2800

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonApache-2.035800

RoleLLM-public

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Language:Python6700

LWM

Language:PythonApache-2.0689900

Awesome-LLM-Interpretability

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

6300

navchat

Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation"

Language:PythonMIT200

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SAS19400

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.01031600

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

Apache-2.018000

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.0880800