Jiaxin Wen (Jiaxin-Wen)

Jiaxin-Wen

Geek Repo

Company:Tsinghua University

Location:Beijing, China

Home Page:https://jiaxin-wen.github.io/

Github PK Tool:Github PK Tool


Organizations
thu-coai

Jiaxin Wen's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonLicense:MITStargazers:32162Issues:288Issues:1376

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26332Issues:215Issues:242

aider

aider is AI pair programming in your terminal

Language:PythonLicense:Apache-2.0Stargazers:19289Issues:137Issues:1517

gitleaks

Protect and discover secrets using Gitleaks 🔑

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Language:PythonLicense:MITStargazers:3315Issues:34Issues:94

chinese-llm-benchmark

中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:2086Issues:39Issues:188

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

Language:PythonLicense:MITStargazers:1965Issues:26Issues:40

code2prompt

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

Language:RustLicense:MITStargazers:1665Issues:11Issues:29

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1456Issues:7Issues:142

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:707Issues:7Issues:20

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonLicense:BSD-3-ClauseStargazers:689Issues:36Issues:46

quiet-star

Code for Quiet-STaR

Language:PythonLicense:Apache-2.0Stargazers:540Issues:13Issues:8

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Language:Jupyter NotebookLicense:MITStargazers:285Issues:5Issues:47

Thought-Cloning

[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking

Language:PythonLicense:MITStargazers:249Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:248Issues:8Issues:13

openlogprobs

Extract full next-token probabilities via language model APIs

bigcodebench

BigCodeBench: Benchmarking Code Generation Towards AGI

Language:PythonLicense:Apache-2.0Stargazers:187Issues:5Issues:35

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Language:PythonLicense:MITStargazers:184Issues:7Issues:17

Humback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Language:PythonLicense:Apache-2.0Stargazers:130Issues:3Issues:9

PPOCoder

Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"

Language:PythonLicense:MITStargazers:94Issues:3Issues:10

llm_debate

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Language:PythonLicense:MITStargazers:74Issues:4Issues:2

fneval

Functional Benchmarks and the Reasoning Gap

Language:TeXLicense:GPL-3.0Stargazers:73Issues:1Issues:8

CiteME

CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.

Language:PythonLicense:NOASSERTIONStargazers:35Issues:10Issues:0