ntt720

ドーム's repositories

Abdominal-Trauma-Detection-code

MIT000

bytetrack_cpp

This project uses yolov8 combined with bytetrack to achieve multi-target tracking

000

DiffPPO

Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning

MIT000

EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Apache-2.0000

engy

Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype.

MIT000

Everlyn-1

The first open autoregressive foundational video AI model.

000

gaio

High performance minimalism async-io(proactor) networking for Golang.

MIT000

Gengine

Unleashing the Power of Distributed Content Management and Transformation

LGPL-3.0000

GitHub-Stats-SVG

A highly customizable GitHub stats SVG generator: Most readme card projects on GitHub look B-O-R-I-N-G, so I made a cool one myself. Cyberpunk style :)

MIT000

hallo2

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

MIT000

im-server

A high-performance IM server.

Apache-2.0000

Kaggle-4th-Place-Solution-LMSYS-Chatbot-Arena-Human-Preference-Predictions

4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions

MIT000

Effortlessly solve LeetCode problems with the power of automation! LeetCode Solver Bot automates fetching problems, generating solutions, debugging, and submission. No more manual coding or debugging—just sit back and let the bot handle the heavy lifting.

000

LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

MIT000

Magic-BI

One-stop data intelligence agent, providing insights from all mainstream data formats in a single dialogue box, including documents, databases, business systems, and images.一站式数据智能体，一个对话框提供所有主流格式数据的见解，包括文档、数据库、业务系统和图像。

AGPL-3.0000

mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

MIT000

nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

Apache-2.0000

On-Device-FinLLM

OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is built by fine-tuning LLaMA using a specialized instruction dataset created from publicly available Chinese financial Q&A data and additional web-scraped financial information.

Apache-2.0000

petereport-zh

PeTeReport中文版，辅助渗透测试过程，让渗透测试报告一键生成，守护网络安全！

BSD-3-Clause000

PUDM

[A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling, 2024, CVPR]

MIT000

qust

000

rag-men

A Contextual RAG Bot Framework

MIT000

raycast2d-draw-server

000

RoboTwin

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins

MIT000

Sutra_QAS

A system demo based on Retrival Argument Generation to answer buddism question

MIT000

threestudio-dreambeast

🐱🐶🐲🐮🐷Implementation of DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer

000

tx-parser

A powerful library for parsing on-chain transactions into clear, human-readable actions, streamlining blockchain data analysis and interpretation. 🐋

MPL-2.0000

virtual_human_stream

The "virtual_human_stream" project is a real-time digital human system supporting audio-video dialogue. It integrates models like ernerf, musetalk, and wav2lip for voice cloning, video stitching, and streaming via RTMP/WebRTC. It’s optimized for high performance and easy customization, with support for ChatGPT dialogue integration.

Apache-2.0000

vllm-mixed-precision

Support mixed-precsion inference with vllm

000

ntt720

ドーム's repositories

Abdominal-Trauma-Detection-code

bytetrack_cpp

DiffPPO

dynamicPDB

EAGLE

engy

Everlyn-1

gaio

Gengine

GitHub-Stats-SVG

hallo2

im-server

Kaggle-4th-Place-Solution-LMSYS-Chatbot-Arena-Human-Preference-Predictions

LeetCode-Solver-Bot

LightRAG

Magic-BI

mini-omni2

nexa-sdk

On-Device-FinLLM

petereport-zh

PUDM

qust

rag-men

raycast2d-draw-server

RoboTwin

Sutra_QAS

threestudio-dreambeast

tx-parser

virtual_human_stream

vllm-mixed-precision