WangYuqi's starred repositories

OpenResearchAssistant

An automated tool for discovering insights from research papaer corpora

Language:HTMLLicense:Apache-2.0Stargazers:102Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8542Issues:0Issues:0

E2B

Secure cloud runtime for AI apps & AI agents. Fully open-source.

Language:TypeScriptLicense:Apache-2.0Stargazers:6252Issues:0Issues:0

latexify_py

A library to generate LaTeX expression from Python code.

Language:PythonLicense:Apache-2.0Stargazers:7083Issues:0Issues:0

leetcode-cli

A cli tool to enjoy leetcode!

Language:JavaScriptLicense:MITStargazers:3607Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:504Issues:0Issues:0

cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

Language:PythonLicense:AGPL-3.0Stargazers:938Issues:0Issues:0

synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Language:PythonLicense:MITStargazers:440Issues:0Issues:0

SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Language:PythonLicense:Apache-2.0Stargazers:1982Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39632Issues:0Issues:0

BrowserGym

BrowserGym, a gym environment for web task automation in the Chromium browser.

Language:PythonLicense:NOASSERTIONStargazers:189Issues:0Issues:0

cover-agent

CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Language:PythonLicense:AGPL-3.0Stargazers:3811Issues:0Issues:0

LEGENT

Open Platform for Embodied Agents

Language:PythonLicense:Apache-2.0Stargazers:142Issues:0Issues:0

WorkArena

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Language:PythonLicense:NOASSERTIONStargazers:86Issues:0Issues:0

AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Language:PythonLicense:AGPL-3.0Stargazers:3232Issues:0Issues:0

LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Language:PythonLicense:MITStargazers:124Issues:0Issues:0
Language:PythonStargazers:74Issues:0Issues:0

webvid

Large-scale text-video dataset. 10 million captioned short videos.

Language:PythonStargazers:528Issues:0Issues:0

gpt-tokens

What are learned in tiktoken?

Language:PythonStargazers:60Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:21Issues:0Issues:0

OneChart

official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"

Language:PythonLicense:Apache-2.0Stargazers:101Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:2901Issues:0Issues:0
Language:PythonStargazers:670Issues:0Issues:0

sailcraft

Data Toolkit for Sailor Language Models

Language:PythonStargazers:65Issues:0Issues:0

reka-vibe-eval

Multimodal language model benchmark, featuring challenging examples

Language:PythonLicense:Apache-2.0Stargazers:136Issues:0Issues:0
Language:PythonLicense:MITStargazers:56Issues:0Issues:0

Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

Stargazers:223Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:368Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0

Blender-3D-document-rendering-pipeline

Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.

Language:PythonLicense:MITStargazers:11Issues:0Issues:0