hsaigroup's repositories

cube-studio

云原生一站式机器学习平台,多租户,数据资产,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练, ai应用商店

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1Issues:1Issues:0

Fay

Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!

Language:JavaScriptLicense:GPL-3.0Stargazers:1Issues:0Issues:0

video_pipe_c

a plugin-oriented framework for video structured.

Language:C++Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AI-Vtuber

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达】 驱动的虚拟主播【Live2D】,可以在 【Bilibili/抖音/快手】 直播中与观众实时互动 或 直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark-gui】生成对观众问题的回答并可以选择【so-vits-svc/DDSP-SVC】变声;通过特定指令协同Stable Diffusion进行画图展示。并且可以自定义文案进行循环播放。

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ai_code_reader

AI项目阅读器 by渡码

Language:PythonStargazers:0Issues:0Issues:0

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Chat-UniVi

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chatglm3_bertvits2

结合了chatglm3和bert_vits2,使得glm3的输出可以通过tts转换成语音

Language:PythonStargazers:0Issues:0Issues:0

ChatGPT-AccessToken-Web

本项目基于使用accesstoken的方式实现了网页版 ChatGPT 的前端,是用ChatGPT-Next-Web项目进行修改而得,默认Main分支对接gpt3.5的模型,gpt4分支对接gpt4模型。另外本项目需要的后端服务是pandora项目。项目是站在ChatGPT-Next-Web和pandora项目的作者肩膀上,感谢他们!

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ChatGPT-Next-Web-1

One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

chatgpt-plus

AI 助手全套开源解决方案,自带运营管理后台,开箱即用。集成了 ChatGPT, Azure, ChatGLM,讯飞星火,文心一言等多个平台的大语言模型。支持 MJ AI 绘画,Stable Diffusion AI 绘画,微博热搜等插件工具。采用 Go + Vue3 + element-plus 实现。

Language:VueLicense:MITStargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GLEE

GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gpt_academic

为ChatGPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型。兼容复旦MOSS, llama, rwkv, 盘古, newbing, claude等

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

joytag

The JoyTag Image Tagging Model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker

Language:PythonStargazers:0Issues:0Issues:0

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

Language:SwiftLicense:MITStargazers:0Issues:0Issues:0

lobe-chat

🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

lobe-chat-plugins

🧩 / 🏪 Plugin Index - This is the plugin index for LobeChat. It accesses index.json from this repository to display a list of available plugins for LobeChat to the user.

Language:TypeScriptStargazers:0Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Omni-VideoAssistant

Video QA Assistant based on LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PandoraNext-TokensTool

支持脚本一键部署和更新pandoraNext和tokensTool双服务,实现openai账号密码获取pool_token、share_token和access_token,且实现接口调用PandoraNext的API并支持接入到多种开源ChatGpt网页,针对于PandoraNext最新版本4.4管理tokens.json和config.json的可视化网页,可以实现通过网页自定义接口实现批量更改刷新Token,每隔五天自动刷新share_token和pool_token,并实现一键开启暂停重启PandoraNext,支持全部PandoraNext部署方法,并支持接口调用获得share_token和pool_token,且支持热部署,已打包好docker镜像,后续将扩展更多功能!

Language:VueLicense:MITStargazers:0Issues:0Issues:0

Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/粘贴/批量导入图片,段落排版/排除水印,扫描/生成二维码。内置多国语言库。

Language:QMLLicense:MITStargazers:0Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0