ConstantinFoe's starred repositories

Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Language:PythonLicense:Apache-2.0Stargazers:1490Issues:0Issues:0

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:252Issues:0Issues:0
Language:Jupyter NotebookStargazers:416Issues:0Issues:0

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language:PythonLicense:MITStargazers:1037Issues:0Issues:0

chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Language:PythonLicense:MITStargazers:20991Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8352Issues:0Issues:0

NDRData-Corona-Liveticker

Textcorpus der Corona-Liveticker-Meldungen von NDR.de 2020-2023

Stargazers:4Issues:0Issues:0

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12325Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:438Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5650Issues:0Issues:0

dalai

The simplest way to run LLaMA on your local machine

Language:CSSStargazers:13100Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:63082Issues:0Issues:0

minimal

Minimal is a Jekyll theme for GitHub Pages

Language:SCSSLicense:CC0-1.0Stargazers:1528Issues:0Issues:0

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Language:PythonLicense:MITStargazers:727Issues:0Issues:0

automated-election-reporting-de

automated-election-reporting-de automates the election reporting by generating articles based on election data

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11165Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2560Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65626Issues:0Issues:0

recurring-content-detector

Unsupervised detection of opening / closing credits, recaps, and previews in video files 🎥🍿🎬

Language:PythonLicense:MITStargazers:90Issues:0Issues:0

wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

License:NOASSERTIONStargazers:981Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36881Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:9207Issues:0Issues:0

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:36735Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:26Issues:0Issues:0

german-asr-lm-tools

Crawling and creating a German language model resource

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

scf

Subtitling Conversion Framework

Language:XSLTLicense:Apache-2.0Stargazers:52Issues:0Issues:0

pruefstelle

Prototyp zum Qualitätsmanagement automatischer Erschließung (Textmining) im Archiv

Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0

benchmarkstt

Open Source AI Benchmarking toolkit for benchmarking speech to text services

Language:PythonLicense:MITStargazers:53Issues:0Issues:0

berts

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

License:MITStargazers:154Issues:0Issues:0

opencv

Open Source Computer Vision Library

Language:C++License:Apache-2.0Stargazers:77322Issues:0Issues:0