rogervaas's repositories
BitDistiller
A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
Cerberus
A few simple, but solid patterns for responsive HTML email templates and newsletters. Even in Outlook and Gmail.
desktop-live-caption
Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference, PyAudio for reading stream, Tkinter for GUI.
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
gptscript
Develop LLM Apps in Natural Language
keycloak-magic-link
Magic Link Authentication for Keycloak
LaVi-Bridge
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Lumos
A RAG LLM co-pilot for browsing the web, powered by local LLMs
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
openlogprobs
Extract full next-token probabilities via language model APIs
pg_activity
pg_activity is a top like application for PostgreSQL server activity monitoring.
PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
presidio
Context aware, pluggable and customizable data protection and anonymization service for text and images
prometheus-vision
An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Self-Rewarding-Language-Models
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
Sensei
Generate Synthetic Data Using OpenAI or MistralAI
stract
web search done right
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.