Peter (pszemraj)

pszemraj

Geek Repo

Location:Zurich, Schweiz

Home Page:pszemraj.carrd.co/

Github PK Tool:Github PK Tool

Peter's repositories

vid2cleantxt

Python API & command-line tool to easily transcribe speech-based video files into clean text

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:157Issues:3Issues:10

textsum

CLI & Python API to easily summarize text-based files with transformers

Language:PythonLicense:Apache-2.0Stargazers:110Issues:5Issues:5

ai-msgbot

Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.

Language:Jupyter NotebookStargazers:46Issues:3Issues:0

BoulderAreaDetector

An app that uses a CNN to classify whether a satellite image shows an area would be a good rock climbing spot or not. On streamlit.

Language:PythonLicense:Apache-2.0Stargazers:17Issues:2Issues:0

confectionary

a tool to quickly create sweet PDF files from text files :cupcake:

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:2

lm-api

Efficiently query multiple prompts with ease: a command-line tool for batch querying large language models.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:2

ml4hc-s22-project01

An investigation into tabular classification with deep NNs for ETHZ Machine Learning for Healthcare on the MIT-BIH arrythmia dataset .

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:1Issues:0

scrape-viz-jobs

A tool for scraping and clustering job postings from ch.indeed.com; Visualization is completed through various clustering and dimensionality reduction techniques.

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

pubmed-text-classification

ETHZ Machine Learning for Healthcare Problem 2: classification of pubmed paper sentences or text into document sections.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

rpunct-cpu

📝An easy-to-use package to restore punctuation of the text + cpu

Language:PythonStargazers:1Issues:0Issues:0

Slack-Export-JSON-to-CSV

Convert Slack messages exported in their complicated JSON format to simple CSV format, by channel or entire exported workspace

Language:PythonLicense:UnlicenseStargazers:1Issues:0Issues:0

SummComparer

compiles and parses the summarization gauntlet and results from various models into a dataset-like format

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

autoEDA-resources

A list of software and papers related to automatic and fast Exploratory Data Analysis

Language:HTMLLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

autolabel

Label, clean and enrich text datasets with LLMs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CASSINI_geo

for the CASSINI hackathon

Language:PythonStargazers:0Issues:1Issues:0

contrastors

Train Models Contrastively in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DailyDialogue-Parser

Parser for DailyDialogue Dataset, updated with some conventions and additional cleaning for text-generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deepcluster

Custom PyTorch model (VGG-16 Auto-Encoder) and custom criterion (Local Aggregation) for image clustering. The repo contains elaborated creation of fungi image data using factory method.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

inbox_cleaner

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

License:Apache-2.0Stargazers:0Issues:0Issues:0

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pythoncode-tutorials

The Python Code Tutorials

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

unlimiformer

Public repo for the preprint "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0