Men Tianyi (Quester-one)

Quester-one

Geek Repo

Company:XDU CASIA

Location:Beijing, China

Home Page:https://quester-one.github.io/

Github PK Tool:Github PK Tool

Men Tianyi's repositories

abstract-state-seqmodel

Code for EMNLP 2023 paper "Emergence of Abstract State Representations in Embodied Sequence Modeling"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

agent-attack

[Arxiv 2024] Adversarial Attacks on Multimodal Agents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Agent-Smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

alignment-handbook

Robust recipes for to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

AutoDroid

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gpt_academic

为ChatGPT/GLM提供图形交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm2等本地模型。兼容复旦MOSS, llama, rwkv, newbing, claude, claude2等

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

gym-cooking

gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

R-Judge

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

Stargazers:0Issues:0Issues:0

SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

Synapse

Trajectory-as-Exemplar Prompting with Memory for Computer Control

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

License:NOASSERTIONStargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

label-words-are-anchors

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

License:MITStargazers:0Issues:0Issues:0

llm-reasoners

A library for advanced large language model reasoning

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

lm-arithmetic

Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"

License:MITStargazers:0Issues:0Issues:0

othello_world

Emergent world representations: Exploring a sequence model trained on a synthetic task

License:MITStargazers:0Issues:0Issues:0

pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

toolbench

ToolBench, an evaluation suite for LLM tool manipulation capabilities.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Language:PythonLicense:MITStargazers:0Issues:0Issues:0