SuperXiang

followers

following

stars

Baidu, Inc.

Shenzhen, Guangdong, China

https://scholar.google.com/citations?user=7n2td58AAAAJ

Yingfei(Jeremy) Xiang's repositories

DeepSeek-MoE

Language:PythonMIT200

AgentLite

Apache-2.0100

alphageometry

Apache-2.0100

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT100

babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Language:Jupyter Notebook100

Cherry_LLM

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Language:Python100

CS-Eval

CS-Eval is a comprehensive evaluation suite for fundamental cybersecurity models or large language models' cybersecurity ability.

MIT100

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Apache-2.0100

deepeval

The LLM Evaluation Framework

Apache-2.0100

demonstrated-feedback

100

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonMIT100

dsir

DSIR large-scale data selection framework for language model training

MIT100

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Apache-2.0100

leetcode-hard-gym

A hard gym for programming

100

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Language:Python100

List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

CC-BY-4.0100

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION100

magikarp

Language:PythonApache-2.0100

ollama

Get up and running with Llama 2, Mistral, and other large language models.

Language:GoMIT100

opro

official code for "Large Language Models as Optimizers"

Language:PythonApache-2.0100

orpo

Official repository for ORPO

Apache-2.0100

PentestGPT

A GPT-empowered penetration testing tool

Language:PythonMIT100

PiSSA

100

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell100

scaling_laws_data_filtering

100

SecEval

100

simple-one-api

OpenAI 接口接入适配，支持千帆大模型平台、讯飞星火大模型、腾讯混元以及MiniMax、Deep-Seek，等兼容OpenAI接口，仅单可执行文件，配置超级简单，一键部署，开箱即用.

100

small-LMs-Task-Planning

Can only LLMs do Reasoning?: Potential of Small Language Models in Task Planning

Language:Jupyter Notebook100

Superfiltering

100

transformer-debugger

Language:PythonMIT100