Wilfredo Martel's repositories
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
awesome-semantic-search
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
BERT-base-Turkish-QA
A custom Turkish question answering system made by fine-tuning BERTurk.
blog
Public repo for HF Semantic Search Explication
ChatGPT
Lightweight package for interacting with ChatGPT's API by OpenAI. Uses reverse engineered official API.
code_summarization
Experiments with LLM-based code summarization using few-shot learning for use in technical audits.
EmbeddingService
REST API microservice for handling Universal Sentence Encoder
feqa
Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)
lightweight-spanish-language-models
ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.
LiteratureReviewBot
Experiment to use GPT-3 to help write grant proposals.
LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
open_robot_actuator_hardware
robot dog
OpenAIAuth
OpenAI Authentication Library for ChatGPT
OpenPrompt
An Open-Source Framework for Prompt-Learning.
Questgen.ai
Question generation using state-of-the-art Natural Language Processing algorithms
rabbitmq-advanced-spring-boot-starter
A generic library for messaging with rabbit mq with extension on spring boot amqp
sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
spring-webdav
WebDAV mapping for Spring Boot - Use an API like a network drive, open data as files, edit and save them.
SQuaD-Question-Answering-using-BERT
Successfully leveraged a pretrained BERT Transformer model for developing a question answering system.
start_with_bloom
Bloom is a new multi-lingual LLM (Large Language Model) from BigScience, a Hunggingface-hosted open collaboration with hundreds of researchers and institutions around the world. This repo contains a notebook and configuration scripts to get started with the basics of text generation using Bloom's 1.3B parameter pre-trained model.
sts_eval
Tools to apply Semantic Textual Similarity (STS) Evaluation to Language Models from Tensorflow Hub, Huggingface, etc.
transcribe-video-audio
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
VoiceAsistant
A VoiceAsistant with WhisperAI speech recognition
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
whisper_real_time
Real time transcription with OpenAI Whisper.
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
xlm-v-experiments
Experiments for XLM-V Transformers Integeration