Clément Doumouro's starred repositories
eng-practices
Google's Engineering Practices documentation
parserator
:bookmark: A toolkit for making domain-specific probabilistic parsers
mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
llm-movieagent
Semantic layer on top of a graph database to provide an LLM with a set of robust tools to interact with the database
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
instructor
structured outputs for llms
Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
nlm-ingestor
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
textract-cli
CLI for running files through AWS Textract
llm-graph-builder
Neo4j graph construction from unstructured data
git-filter-repo
Quickly rewrite git repository history (filter-branch replacement)
llama_parse
Parse files for optimal RAG