Jian Wang (iwangjian)

iwangjian

Geek Repo

Company:PolyU

Location:Hong Kong

Home Page:https://iwangjian.github.io

Github PK Tool:Github PK Tool


Organizations
polyunlp

Jian Wang's starred repositories

AutoAct

[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning

Language:PythonLicense:Apache-2.0Stargazers:123Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:835Issues:0Issues:0

psi

Platform for Situated Intelligence

Language:C#License:NOASSERTIONStargazers:523Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:1899Issues:0Issues:0

SALMON

Self-Alignment with Principle-Following Reward Models

Language:PythonLicense:GPL-3.0Stargazers:127Issues:0Issues:0

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1532Issues:0Issues:0

HiFT

memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

repoqa

RepoQA: Evaluating Long-Context Code Understanding

Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

Language:PythonLicense:NOASSERTIONStargazers:650Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:771Issues:0Issues:0

EvoCodeBench

An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories

Language:PythonLicense:Apache-2.0Stargazers:22Issues:0Issues:0

MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Language:PythonLicense:MITStargazers:37Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:8141Issues:0Issues:0

sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Language:PythonLicense:MITStargazers:114Issues:0Issues:0

sotopia-pi

Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:38Issues:0Issues:0
Language:PythonLicense:MITStargazers:3876Issues:0Issues:0

navchat

Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" https://arxiv.org/abs/2310.07968

Language:PythonLicense:MITStargazers:14Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0

ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

Language:PythonStargazers:59Issues:0Issues:0

NLP-Movie_Scripts

Trying to predict a movie's success based on the script (before filming)

Language:Jupyter NotebookStargazers:28Issues:0Issues:0

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonLicense:Apache-2.0Stargazers:358Issues:0Issues:0

RoleLLM-public

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

Stargazers:405Issues:0Issues:0

ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Language:PythonStargazers:67Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6899Issues:0Issues:0

Awesome-LLM-Interpretability

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

Stargazers:63Issues:0Issues:0

navchat

Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation"

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SASStargazers:194Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:10316Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:180Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8808Issues:0Issues:0