mathpopo's repositories

Llama2-Chinese

Llama中文社区,最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:6Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Awesome-Linux-Software-zh_CN

🐧 一个 Linux 上超赞的应用,软件,工具以及其它资源的集中地。

Stargazers:0Issues:0Issues:0

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

License:NOASSERTIONStargazers:0Issues:0Issues:0

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

CVCUDA_FaceStoreHelper-release

Psyche AI Inc release source "CVCUDA_FaceStoreHelper"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Experiments-with-Gemma-2B

I’ll be testing different Gemma models and sharing the results here and on my Hugging Face space. Stay tuned for updates!

License:MITStargazers:0Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

License:NOASSERTIONStargazers:0Issues:0Issues:0

magic-avatar

MagicAvatar: Multimodal Avatar Generation and Animation

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nvm

Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions

Language:ShellLicense:MITStargazers:0Issues:0Issues:0

pandas-llm

Pandas-LLM

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

project-based-learning

Curated list of project-based tutorials

License:MITStargazers:0Issues:0Issues:0

python-docs-samples

Code samples used on cloud.google.com

License:Apache-2.0Stargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Real-Gemini

Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。

License:Apache-2.0Stargazers:0Issues:0Issues:0

recognize-anything

Code for the Recognize Anything Model (RAM) and Tag2Text Model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

torchexplorer

Interactively inspect module inputs, outputs, parameters, and gradients.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

yolo-world-with-efficientvit-sam

YOLO-World + EfficientViT SAM

License:Apache-2.0Stargazers:0Issues:0Issues:0