Lars Nieradzik's repositories
kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
object-localization
Object localization in images using simple CNNs and Keras
chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
segmentation_activations
Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"
pysais-utf8
Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
helloworld
helloworld program using JSF, Maven, Glassfish, Java EE.
llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across NSynth, PTDB, and MDB-STEM-Synth datasets.
story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
woodyolo
A specialized object detection model originally designed for microscopic wood vessel identification but applicable to any high-recall detection task.