Luca Foppiano's repositories
grobid-quantities
GROBID extension for identifying and normalizing physical quantities.
streamlit-pdf-viewer
Streamlit PDF viewer
structure-vision
Viewer for the structure extracted by Grobid on PDF documents
document-qa
Scientific Document Insight Q/A
grobid-superconductors
Grobid module for superconductor material and properties extraction
PhD-Thesis
PhD Dissertation "Automated Extraction and Curation of Materials Information from Scientific Literature"
material-parsers
Material parsers and other tools, scripts Initially developed for Grobid Superconductor
grobid-quantities-python-client
Python client for Grobid Quantities
MatSci-LumEn
MatSci-LumEn: Materials Science Large Language Models Evaluation for text and data mining
stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
MatTPUSciBERT
Material SciBERT trained on TPU
entity-fishing
A machine learning tool for fishing entities
langchain
🦜🔗 Build context-aware reasoning applications
prompt-layer-library
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
Pub2TEI
Service for converting and enhancing heterogeneous publisher XML formats into TEI
scipdf_parser
Python PDF parser for scientific publications: content and figures
supercon2-paper
Repository of the article "Semi-automatic staging area for high-quality structured data extraction from scientific literature"
tika
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.