杨夕 (km1994)

km1994

User data from Github https://github.com/km1994

Company:某某螺丝钉加工厂

GitHub:@km1994

杨夕's repositories

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

AwesomeMultiModel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。

Awesome-Tabular-LLMs

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

Stargazers:4Issues:0Issues:0

Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

License:Apache-2.0Stargazers:3Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0

awesome-mcp-servers

A collection of MCP servers.

License:MITStargazers:2Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:2Issues:0Issues:0

cv_note

记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

OpenManus

No fortress, purely open ground. OpenManus is Coming.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

agents-course

This repository contains the Hugging Face Agents Course.

Language:MDXLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-rag

Awesome-RAG: Collect typical RAG papers and systems.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Controllable-RAG-Agent

This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

deer-flow

DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

License:MITStargazers:0Issues:0Issues:0

DiffRhythm

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

License:Apache-2.0Stargazers:0Issues:0Issues:0

flowgram.ai

FlowGram is a node-based flow building engine that helps developers quickly create workflows in either fixed layout or free connection layout modes

License:MITStargazers:0Issues:0Issues:0
License:MIT-0Stargazers:0Issues:0Issues:0

marp-cli

A CLI interface for Marp and Marpit based converters

License:MITStargazers:0Issues:0Issues:0

open-r1

Fully open reproduction of DeepSeek-R1

License:Apache-2.0Stargazers:0Issues:0Issues:0

Prompt_Engineering

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essential resource for mastering the art of effectively communicating with and leveraging large language models in AI applications.

License:NOASSERTIONStargazers:0Issues:0Issues:0

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

License:NOASSERTIONStargazers:0Issues:0Issues:0

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".

Language:PythonStargazers:0Issues:0Issues:0

Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

WeClone

🚀从聊天记录创造数字分身的一站式解决方案💡 使用微信聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA

License:AGPL-3.0Stargazers:0Issues:0Issues:0