Emerson Pedroso's repositories
xphoneBR
XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use
AFFiNE
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.
alexa-gpt
A tutorial on how to use ChatGPT in Alexa
AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
consumo-api-datajud
Projeto em Angular para consumir a API do Datajud, possibilitando a visualização e gestão de dados judiciais de forma eficiente. Inclui integração com o serviço de backend e manipulação de requisições HTTP, facilitando o acesso a informações através de endpoints fornecidos pela API.
docker
The Docker configuration for Cal.com is an effort powered by people within the community. Cal.com, Inc. does not provide official support for Docker, but we will accept fixes and documentation. Use at your own risk.
facefusion
Next generation face swapper and enhancer
faucet
Bitcoinnano Faucet
gemini-openai-nextjs
OpenAI to Google Gemini https://gemini-openai-proxy.deno.dev
gemini-to-openai-proxy
Call Gemini (https://ai.google.dev) embedding models with OpenAI-compatible endpoints
Grouple
lms complete
gruut-ipa
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Research-Engine
A web app to help you in your research!
resemble-enhance
AI powered speech denoising and enhancement
searxng-docker
The docker-compose files for setting up a SearXNG instance with docker.
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
shiva
Shiva library: Implementation in Rust of a parser and generator for documents of any type
StyleTTS-ZS
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
unstract
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Video-Creator
This project is to automate the video creation.
whisper-web
ML-powered speech recognition directly in your browser