Lars Nieradzik's repositories

kmeans-anchor-boxes

k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper

Language:PythonLicense:MITStargazers:534Issues:10Issues:21

object-localization

Object localization in images using simple CNNs and Keras

Language:PythonLicense:MITStargazers:137Issues:5Issues:20

chinese-subtitle-ocr

Optical character recognition for Chinese subtitles using SSD and CNN

Language:PythonLicense:MITStargazers:108Issues:8Issues:4

forced-alignment-chinese

Mandarin Chinese audio datasets aligned with Montreal Forced Aligner

Language:PythonLicense:MITStargazers:9Issues:1Issues:0

fastspeech2-clean

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

Language:PythonLicense:MITStargazers:6Issues:2Issues:3

bigvgan-mirror

A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

segmentation_activations

Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

pysais-utf8

Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.

Language:CLicense:MITStargazers:1Issues:0Issues:0

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

config

Simple config library in C

Language:CLicense:MITStargazers:0Issues:1Issues:0

helloworld

helloworld program using JSF, Maven, Glassfish, Java EE.

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

llm-cn-en-dict

Using LLMs to generate a synthetic Chinese-English dictionary

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pitch-benchmark

Comprehensive benchmark suite comparing pitch detection algorithms across NSynth, PTDB, and MDB-STEM-Synth datasets.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

story-evaluation-llm

LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonStargazers:0Issues:0Issues:0

woodyolo

A specialized object detection model originally designed for microscopic wood vessel identification but applicable to any high-recall detection task.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0