traderpedroso

Emerson Pedroso's repositories

xphoneBR

XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use

Language:PythonMIT3 2 1

AFFiNE

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.

Language:TypeScriptNOASSERTION000

alexa-gpt

A tutorial on how to use ChatGPT in Alexa

Language:PythonMIT000

AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonMIT000

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonApache-2.0000

consumo-api-datajud

Projeto em Angular para consumir a API do Datajud, possibilitando a visualização e gestão de dados judiciais de forma eficiente. Inclui integração com o serviço de backend e manipulação de requisições HTTP, facilitando o acesso a informações através de endpoints fornecidos pela API.

MIT000

docker

The Docker configuration for Cal.com is an effort powered by people within the community. Cal.com, Inc. does not provide official support for Docker, but we will accept fixes and documentation. Use at your own risk.

MIT000

facefusion

Next generation face swapper and enhancer

NOASSERTION000

faucet

Bitcoinnano Faucet

Language:TypeScriptGPL-3.0000

faucetbuilded

000

gemini-openai-nextjs

OpenAI to Google Gemini https://gemini-openai-proxy.deno.dev

MIT000

gemini-to-openai-proxy

Call Gemini (https://ai.google.dev) embedding models with OpenAI-compatible endpoints

Language:GoMIT000

Grouple

lms complete

000

gruut-ipa

Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)

Language:PythonMIT000

langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

NOASSERTION000

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

MIT000

PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Language:PythonMIT000

Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

MIT000

Research-Engine

A web app to help you in your research!

AGPL-3.0000

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT000

searxng-docker

The docker-compose files for setting up a SearXNG instance with docker.

AGPL-3.0000

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Apache-2.0000

shiva

Shiva library: Implementation in Rust of a parser and generator for documents of any type

Language:RustGPL-3.0000

StyleTTS-ZS

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

000

SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

MIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

AGPL-3.0000

Video-Creator

This project is to automate the video creation.

AGPL-3.0000

video_creator_frontend

000

whisper-web

ML-powered speech recognition directly in your browser

MIT000