Xuechen Li (lxuechen)

lxuechen

Geek Repo

Company:Stanford University

Home Page:www.lxuechen.com

Twitter:@lxuechen

Github PK Tool:Github PK Tool


Organizations
lxuechen-org
stanford-crfm
stanfordnlp
VectorInstitute

Xuechen Li's starred repositories

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:58506Issues:0Issues:0

Inflection-Benchmarks

Public Inflection Benchmarks

License:MITStargazers:66Issues:0Issues:0

showdown

A bidirectional Markdown to HTML to Markdown converter written in Javascript

Language:JavaScriptLicense:MITStargazers:13959Issues:0Issues:0

LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

Stargazers:394Issues:0Issues:0

promptbase

All things prompt engineering

Language:PythonLicense:MITStargazers:5110Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:16755Issues:0Issues:0

GPTFast

Accelerate your Hugging Face Transformers 6-8.5x. Native to Hugging Face and PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:636Issues:0Issues:0

python-ftfy

Fixes mojibake and other glitches in Unicode text, after the fact.

Language:PythonLicense:NOASSERTIONStargazers:3725Issues:0Issues:0

RingAttention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:532Issues:0Issues:0

vscode-chatgpt

An unofficial Visual Studio Code - OpenAI ChatGPT integration

Language:TypeScriptLicense:ISCStargazers:3466Issues:0Issues:0

playwright

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

Language:TypeScriptLicense:Apache-2.0Stargazers:62283Issues:0Issues:0

old

A Codemirror-based editor with many modern need-to-haves (e.g. LSP, Copilot, Vim, Remote SSH)

Language:TypeScriptLicense:MITStargazers:195Issues:0Issues:0

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:188Issues:0Issues:0

ts-results

A typescript implementation of Rust's Result object.

Language:TypeScriptLicense:MITStargazers:1074Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5066Issues:0Issues:0

commander.js

node.js command-line interfaces made easy

Language:JavaScriptLicense:MITStargazers:26180Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:13918Issues:0Issues:0

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:4806Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5254Issues:0Issues:0

RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

Language:TypeScriptLicense:MITStargazers:4609Issues:0Issues:0

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Language:PythonLicense:Apache-2.0Stargazers:1411Issues:0Issues:0
Language:PythonStargazers:137Issues:0Issues:0

wimbd

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Language:PythonLicense:Apache-2.0Stargazers:136Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:11602Issues:0Issues:0

gharchive.org

GH Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis.

Language:RubyLicense:MITStargazers:2579Issues:0Issues:0

notion-sdk-js

Official Notion JavaScript Client

Language:TypeScriptLicense:MITStargazers:4617Issues:0Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1467Issues:0Issues:0

pkl

A configuration as code language with rich validation and tooling.

Language:JavaLicense:Apache-2.0Stargazers:9659Issues:0Issues:0

openai-openapi

OpenAPI specification for the OpenAI API

License:MITStargazers:999Issues:0Issues:0

vscodium

binary releases of VS Code without MS branding/telemetry/licensing

Language:ShellLicense:MITStargazers:23888Issues:0Issues:0