Thomas Boquet's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
text-generation-inference
Large Language Model Text Generation Inference
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
alignment-handbook
Robust recipes to align language models with human and AI preferences
CTranslate2
Fast inference engine for Transformer models
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
alibi-detect
Algorithms for outlier, adversarial and drift detection
consistencydecoder
Consistency Distilled Diff VAE
LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
legal-ml-datasets
A collection of datasets and tasks for legal machine learning
tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
twitter-reddit-agent
Scrape Tweets or Reddit submissions and chat with them using Langchain
fca_bulk_data
Bulk Access to Federal Court of Appeal (Canada) Decisions