Lars Nieradzik's repositories
kmeans-anchor-boxes
k-means clustering with the Intersection over Union (IoU) metric as described in the YOLO9000 paper
object-localization
Object localization in images using simple CNNs and Keras
chinese-subtitle-ocr
Optical character recognition for Chinese subtitles using SSD and CNN
pitch-benchmark
Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.
fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
bigvgan-mirror
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
segmentation_activations
Code for the paper "Effect of the output activation function on the probabilities and errors in medical image segmentation"
pysais-utf8
Python C module for creating suffix, LCP and BWT arrays with UTF-8 text.
story-evaluation-llm
LLM-generated story dataset with quality evaluations across 15 models for training and benchmarking creative writing capabilities.
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
helloworld
helloworld program using JSF, Maven, Glassfish, Java EE.
llm-cn-en-dict
Using LLMs to generate a synthetic Chinese-English dictionary
g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
woodyolo
A specialized object detection model originally designed for microscopic wood vessel identification but applicable to any high-recall detection task.