Jorge Iranzo's starred repositories
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
ccextractor
CCExtractor - Official version maintained by the core team
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
paella-core
Paella Player core library
eamt24-linguistic-mt
A repo for resources for our EAMT 2024 tutorial
llm-foundry
LLM training code for Databricks foundation models
CroCoAlign
A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.
compare-mt
A tool for holistic analysis of language generations systems
tensorrt_backend
The Triton backend for TensorRT.