scwp's starred repositories

public-apis

A collective list of free APIs

Language:PythonLicense:MITStargazers:306504Issues:4164Issues:615

coding-interview-university

A complete computer science study plan to become a software engineer.

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129757Issues:1120Issues:15310

hiring-without-whiteboards

⭐️ Companies that don't have a broken hiring process

Language:JavaScriptLicense:MITStargazers:43273Issues:823Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:23887Issues:438Issues:124

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:23011Issues:309Issues:972

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20292Issues:197Issues:366

Cookbook

The Data Engineering Cookbook

PySimpleGUI

Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.

Language:PythonLicense:NOASSERTIONStargazers:13286Issues:230Issues:3627

miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language:GoLicense:NOASSERTIONStargazers:8735Issues:71Issues:642

awesome-gis

😎Awesome GIS is a collection of geospatial related sources, including cartographic tools, geoanalysis tools, developer tools, data, conference & communities, news, massive open online course, some amazing map sites, and more.

tablesaw

Java dataframe and visualization library

Language:JavaLicense:Apache-2.0Stargazers:3492Issues:142Issues:724

public-apis

A collective list of free APIs

Language:PythonLicense:MITStargazers:1970Issues:36Issues:0

python-zeep

A Python SOAP client

Language:PythonLicense:NOASSERTIONStargazers:1876Issues:65Issues:1054

linkedin-api

👨‍💼Linkedin API for Python

Language:PythonLicense:MITStargazers:1760Issues:45Issues:252

manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

Language:PythonLicense:Apache-2.0Stargazers:1526Issues:15Issues:59

BlueRetro

Multiplayer Bluetooth controllers adapter for retro video game consoles

Language:CLicense:Apache-2.0Stargazers:1238Issues:49Issues:364

awesome-geospatial-companies

:globe_with_meridians: List & Map of 700+ companies for geospatial jobs (GIS, Earth Observation, UAV, Satellite, Digital Farming, ..)

Language:PythonLicense:MITStargazers:701Issues:31Issues:42

joinery

Data frames for Java

Language:JavaLicense:GPL-3.0Stargazers:692Issues:43Issues:83
Language:PythonLicense:MITStargazers:497Issues:8Issues:81

email-to-pdf-converter

Converts email files (eml, msg) to pdf

Language:JavaLicense:Apache-2.0Stargazers:264Issues:14Issues:57

eml_parser

python eml parser module

Language:PythonLicense:AGPL-3.0Stargazers:206Issues:14Issues:60

WebCrawlerForOnlineInflation

Price Crawler - Tracking Price Inflation

Language:PythonStargazers:178Issues:7Issues:0

datasets

Various interesting datasets, mostly data from The University of Illinois

Language:Jupyter NotebookStargazers:173Issues:17Issues:7

EO-jobs

🛰️ List of earth observation companies and job sites

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:121Issues:9Issues:1

reddit-streaming-pipeline

A real-time reddit data streaming pipeline for sentiment analysis of various subreddits

Language:HCLLicense:MITStargazers:87Issues:4Issues:1
Language:JavaScriptLicense:MITStargazers:74Issues:3Issues:9

sec_employee_information_extraction

NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values for companies from SEC filings.

Language:Jupyter NotebookStargazers:15Issues:1Issues:0

enrich-linkedin-companies-in-bulk

This script takes in a CSV file of Linkedin companies, enriches it, and returns the following fields: * Linkedin URL * Website * Company name * Employee count (range) * Follower count

Language:PythonStargazers:9Issues:0Issues:0

PSU-2019FALL-GEOG365-GISIntroR

This repository contains the materials for GEOG 365, Intruduction to GIS Programming with R, during the 2019 Fall semester at Penn State.

Language:RLicense:MITStargazers:3Issues:3Issues:1