xaviviro

xaviviro

Geek Repo

Location:Barcelona

Github PK Tool:Github PK Tool

xaviviro's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23261Issues:0Issues:0

TheBigPromptLibrary

A collection of prompts, system prompts and LLM instructions

Language:HTMLLicense:MITStargazers:730Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:MITStargazers:7286Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

news-please

news-please - an integrated web crawler and information extractor for news that just works

Language:PythonLicense:Apache-2.0Stargazers:1999Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29602Issues:0Issues:0

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonLicense:MITStargazers:754Issues:0Issues:0

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Language:PythonLicense:Apache-2.0Stargazers:911Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:7259Issues:0Issues:0

langchain-swift

🚀 LangChain for Swift. Optimized for iOS, macOS, watchOS (part) and visionOS.(beta)

Language:SwiftLicense:Apache-2.0Stargazers:269Issues:0Issues:0

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:2476Issues:0Issues:0

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:11237Issues:0Issues:0

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonLicense:MITStargazers:17469Issues:0Issues:0

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonLicense:NOASSERTIONStargazers:12112Issues:0Issues:0

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:182340Issues:0Issues:0

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonLicense:MITStargazers:11046Issues:0Issues:0
Language:PythonStargazers:113Issues:0Issues:0

awesome-llm-apps

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Language:PythonLicense:CC0-1.0Stargazers:2564Issues:0Issues:0

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonLicense:MITStargazers:29328Issues:0Issues:0

PoseEstimationForMobile

:dancer: Real-time single person pose estimation for Android and iOS.

Language:C++License:Apache-2.0Stargazers:1000Issues:0Issues:0

ragapp

The easiest way to use Agentic RAG in any enterprise

Language:TypeScriptLicense:Apache-2.0Stargazers:2916Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:2077Issues:0Issues:0

WhisperKit

On-device Speech Recognition for Apple Silicon

Language:SwiftLicense:MITStargazers:2962Issues:0Issues:0

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:8409Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:10397Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:109Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:38304Issues:0Issues:0

quivr

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework

Language:PythonLicense:NOASSERTIONStargazers:34268Issues:0Issues:0

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonLicense:MITStargazers:2807Issues:0Issues:0

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1910Issues:0Issues:0