Thomas Boquet (tboquet)

tboquet

Geek Repo

Company:@Mistplay

Location:Montréal

Github PK Tool:Github PK Tool

Thomas Boquet's starred repositories

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32491Issues:232Issues:4135

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23268Issues:194Issues:3634

loguru

Python logging made (stupidly) simple

Language:PythonLicense:MITStargazers:18486Issues:140Issues:960

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10408Issues:139Issues:310

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8191Issues:99Issues:1138

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7085Issues:111Issues:146

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6299Issues:61Issues:76

chainlit

Build Conversational AI in minutes ⚡️

Language:TypeScriptLicense:Apache-2.0Stargazers:5829Issues:48Issues:563

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4075Issues:112Issues:118

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:2948Issues:56Issues:646

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2525Issues:30Issues:148

prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Language:PythonLicense:Apache-2.0Stargazers:2499Issues:29Issues:59

alibi-detect

Algorithms for outlier, adversarial and drift detection

Language:PythonLicense:NOASSERTIONStargazers:2118Issues:37Issues:356

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2082Issues:23Issues:19

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:906Issues:24Issues:184

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonLicense:Apache-2.0Stargazers:806Issues:13Issues:22

MRL

Code repository for the paper - "Matryoshka Representation Learning"

Language:Jupyter NotebookLicense:MITStargazers:347Issues:7Issues:6
Language:Jupyter NotebookLicense:MITStargazers:340Issues:7Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:330Issues:18Issues:16

torchsynth

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

Language:PythonLicense:Apache-2.0Stargazers:318Issues:12Issues:165

cords

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

Language:Jupyter NotebookLicense:MITStargazers:313Issues:13Issues:47

legal-ml-datasets

A collection of datasets and tasks for legal machine learning

tasksource

Datasets collection and preprocessings framework for NLP extreme multitask learning

Language:PythonLicense:Apache-2.0Stargazers:131Issues:4Issues:8

TACTiS

TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series, from ServiceNow Research

Language:PythonLicense:Apache-2.0Stargazers:106Issues:10Issues:7

tasknet

Easy multi-task learning with HuggingFace Datasets and Trainer

Language:PythonLicense:GPL-3.0Stargazers:37Issues:3Issues:6

STTD

Uncertainty Quantification via Spatial-Temporal Tweedie Model for Zero-inflated and Long-tail Travel Demand Prediction

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

twitter-reddit-agent

Scrape Tweets or Reddit submissions and chat with them using Langchain

Language:PythonStargazers:4Issues:0Issues:0
Language:PythonLicense:MITStargazers:4Issues:0Issues:0

fca_bulk_data

Bulk Access to Federal Court of Appeal (Canada) Decisions

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:0Issues:0
Language:JuliaLicense:MITStargazers:1Issues:0Issues:0