Run Huang (itsrun)

itsrun

Geek Repo

Location:Los Angeles, USA

Home Page:https://iamhuang.run

Github PK Tool:Github PK Tool


Organizations
ARamsay118

Run Huang's starred repositories

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8842Issues:63Issues:213

html-to-image

✂️ Generates an image from a DOM node using HTML5 canvas and SVG.

Language:TypeScriptLicense:MITStargazers:5704Issues:31Issues:323

cloudscraper

A Python module to bypass Cloudflare's anti-bot page.

Language:PythonLicense:MITStargazers:4337Issues:151Issues:0

pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language:HTMLLicense:NOASSERTIONStargazers:3752Issues:56Issues:136

fake-useragent

Up-to-date simple useragent faker with real world database

Language:PythonLicense:Apache-2.0Stargazers:3650Issues:61Issues:138

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3480Issues:32Issues:201

grobid

A machine learning software for extracting information from scholarly documents

Language:JavaLicense:Apache-2.0Stargazers:3470Issues:96Issues:868

BrowserBox

🌀 Browse the web from a web page. Remote browser isolation. For security, privacy and more! By https://dosyago.com

Language:JavaScriptLicense:NOASSERTIONStargazers:3398Issues:30Issues:297

puppeteer-cluster

Puppeteer Pool, run a cluster of instances in parallel

Language:TypeScriptLicense:MITStargazers:3217Issues:48Issues:255

internetarchive

A Python and Command-Line Interface to Archive.org

Language:PythonLicense:AGPL-3.0Stargazers:1592Issues:56Issues:386

wayback

IA's public Wayback Machine (moved from SourceForge)

megadesk

Open-source IKEA Bekant controller board

Language:HTMLLicense:GPL-3.0Stargazers:721Issues:22Issues:90

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:684Issues:9Issues:33

py-lmdb

Universal Python binding for the LMDB 'Lightning' Database

Language:CLicense:NOASSERTIONStargazers:643Issues:26Issues:286

mini-react

手写react、react-dom、react reconciler主流程源码,加深对react源码的理解。包括fiber,合成事件,hooks实现原理,dom diff,reconciliation,scheduler等

pdf.js-hypothes.is

PDF.js + Hypothesis viewer / annotator

scipdf_parser

Python PDF parser for scientific publications: content and figures

Language:PythonLicense:MITStargazers:330Issues:9Issues:18

grobid_client_python

Python client for GROBID Web services

Language:PythonLicense:Apache-2.0Stargazers:280Issues:6Issues:54

pyscisci

Science of Science

Language:PythonLicense:MITStargazers:155Issues:11Issues:15

pyalex

A Python library for OpenAlex (openalex.org)

Language:PythonLicense:MITStargazers:153Issues:4Issues:13

papers-ux-ai-programming

List of research papers of research papers investigating the user experience of AI-powered programming assistants (e.g., Copilot).

License:MITStargazers:79Issues:4Issues:0

wayback

A Python API to the Internet Archive Wayback Machine

Language:PythonLicense:BSD-3-ClauseStargazers:63Issues:8Issues:59

article_dataset_builder

Open Access PDF harvester, metadata aggregator and full-text ingester

Language:PythonLicense:Apache-2.0Stargazers:54Issues:5Issues:4

openalex-concept-tagging

Scripts used to make and evaluate OpenAlex's concept tagging model

Language:Jupyter NotebookLicense:MITStargazers:48Issues:7Issues:5

FDU-CS-ClassMaterials

Study Materials

Language:JavaStargazers:43Issues:1Issues:0

adscraper

A web crawler for scraping online ad content

Language:TypeScriptLicense:MITStargazers:20Issues:9Issues:4

usc-csci-670-advanced-algos-notes

My LaTeX notes from the Ph.D. level Advanced Algorithms course at the University of Southern California

Language:TeXStargazers:11Issues:1Issues:0

PAM2023-CDNPassword

Quantifying User Password Exposure to Third-Party

Language:JavaScriptStargazers:2Issues:1Issues:0