Slice (xuanhan863)

xuanhan863

Geek Repo

Location:Los Angeles, USA

Github PK Tool:Github PK Tool

Slice's starred repositories

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:14707Issues:62Issues:174

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Language:PythonLicense:Apache-2.0Stargazers:3248Issues:28Issues:67

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonLicense:NOASSERTIONStargazers:1699Issues:40Issues:108

ja4

JA4+ is a suite of network fingerprinting standards

Language:RustLicense:NOASSERTIONStargazers:809Issues:20Issues:64

markdowner

A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai

Language:TypeScriptLicense:MITStargazers:634Issues:5Issues:4

scribe

Renders music in HTML.

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:521Issues:21Issues:22

paddler

Stateful load balancer custom-tailored for llama.cpp

Language:GoLicense:MITStargazers:452Issues:6Issues:3

chatgpt-cli

ChatGPT CLI is an advanced command-line interface for ChatGPT models via OpenAI and Azure, offering streaming, query mode, and history tracking for seamless, context-aware conversations. Ideal for both users and developers, it provides advanced configuration and easy setup options to ensure a tailored conversational experience with the GPT model.

Language:GoLicense:MITStargazers:415Issues:10Issues:32

Glyph-ByT5

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:413Issues:17Issues:15

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:318Issues:12Issues:14

gazelle

Joint speech-language model - respond directly to audio!

Language:PythonLicense:Apache-2.0Stargazers:293Issues:12Issues:1

radient

Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

Language:PythonLicense:BSD-2-ClauseStargazers:240Issues:4Issues:1

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonLicense:MITStargazers:200Issues:3Issues:1

RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

Language:PythonLicense:Apache-2.0Stargazers:142Issues:17Issues:8

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

Language:PythonLicense:Apache-2.0Stargazers:136Issues:7Issues:4

gezgin

Modern Pathfinding Using OpenStreetMap Data with Raylib

Language:C++License:WTFPLStargazers:132Issues:1Issues:0

FAcodec

Training code for FAcodec presented in NaturalSpeech3

MuLan

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

stream-vc

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

Language:PythonStargazers:87Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:62Issues:12Issues:5

ClickDiffusion

ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing

Language:PythonLicense:MITStargazers:61Issues:2Issues:2

nvImageCodec

A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface

Language:C++License:Apache-2.0Stargazers:54Issues:10Issues:7

X-Oscar

About Official repository for "X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation"

Language:PythonStargazers:46Issues:0Issues:0

TurboT5

Truly flash T5 realization!

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

Spatial-AST

🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)

Language:PythonLicense:NOASSERTIONStargazers:23Issues:0Issues:0

DAC-JAX

A JAX Implementation of the Descript Audio Codec

Language:PythonLicense:MITStargazers:17Issues:2Issues:0

PDM-Pure

PDM-based Purifier

forcealign

ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0