MovLab2's repositories

Agent-0

This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.

Language:PythonStargazers:0Issues:0Issues:0

agent-zero

Agent Zero AI framework

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

agents

Build real-time multimodal AI applications 🤖🎙️📹

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amica

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

License:MITStargazers:0Issues:0Issues:0

Aria

Codebase for Aria - an Open Multimodal Native MoE

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

cognitive-services-speech-sdk

Sample code for the Microsoft Cognitive Services Speech SDK

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DingoQuadruped

Base code for the Dingo quadruped; modified from Stanford Pupper and Notspot repositories. Includes integration with ROS Noetic and a simulation of the Dingo

License:MITStargazers:0Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:0Issues:0Issues:0

F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

local-talking-llm

A talking LLM that runs on your own computer without needing the internet.

License:MITStargazers:0Issues:0Issues:0

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

License:Apache-2.0Stargazers:0Issues:0Issues:0

Microsoft-Activation-Scripts

Open-source Windows and Office activator featuring HWID, Ohook, KMS38, and Online KMS activation methods, along with advanced troubleshooting.

License:GPL-3.0Stargazers:0Issues:0Issues:0

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

openedai-speech

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:0Issues:0Issues:0

text-to-audio2face

Web interface to convert text to speech and route it to an Audio2Face streaming player.

License:MITStargazers:0Issues:0Issues:0

UEVR

Universal Unreal Engine VR Mod (4.8 - 5.3)

Stargazers:0Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

License:NOASSERTIONStargazers:0Issues:0Issues:0

WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

License:MITStargazers:0Issues:0Issues:0