mathpopo

followers

following

stars

mathpopo's repositories

Llama2-Chinese

Llama中文社区，最好的中文Llama大模型，完全开源可商用

Language:Python600

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION000

Awesome-Linux-Software-zh_CN

🐧 一个 Linux 上超赞的应用，软件，工具以及其它资源的集中地。

000

codellama

Inference code for CodeLlama models

Language:PythonNOASSERTION000

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

NOASSERTION000

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++Apache-2.0000

CVCUDA_FaceStoreHelper-release

Psyche AI Inc release source "CVCUDA_FaceStoreHelper"

Language:PythonNOASSERTION000

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonMIT000

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookApache-2.0000

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Apache-2.0000

EMO

000

Experiments-with-Gemma-2B

I’ll be testing different Gemma models and sharing the results here and on my Hugging Face space. Stay tuned for updates!

MIT000

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Apache-2.0000

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonMIT000

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonBSD-3-Clause000

llama

Inference code for LLaMA models

NOASSERTION000

magic-avatar

MagicAvatar: Multimodal Avatar Generation and Animation

BSD-3-Clause000

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonApache-2.0000

nvm

Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions

Language:ShellMIT000

pandas-llm

Pandas-LLM

Language:PythonMIT000

project-based-learning

Curated list of project-based tutorials

MIT000

python-docs-samples

Code samples used on cloud.google.com

Apache-2.0000

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonBSD-3-Clause000

Real-Gemini

Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架，通过文本、语音、图像和视频和这是世界进行问答和交流。

Apache-2.0000

recognize-anything

Code for the Recognize Anything Model (RAM) and Tag2Text Model

Language:Jupyter NotebookApache-2.0000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonMIT000

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION000

torchexplorer

Interactively inspect module inputs, outputs, parameters, and gradients.

Language:PythonNOASSERTION000

Waifu2x-Extension-GUI

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.

Language:C++NOASSERTION000

yolo-world-with-efficientvit-sam

YOLO-World + EfficientViT SAM

Apache-2.0000