Pavel Smirnov (Venopacman)

Venopacman

Geek Repo

Company:EPAM

Github PK Tool:Github PK Tool

Pavel Smirnov's starred repositories

ltrlib

A Learn-to-Rank algorithm library

Language:ScalaLicense:Apache-2.0Stargazers:10Issues:0Issues:0

metarank

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

Language:ScalaLicense:Apache-2.0Stargazers:1997Issues:0Issues:0

vosk-tts

Text To Speech Synthesis with Vosk

Language:PythonLicense:Apache-2.0Stargazers:98Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5600Issues:0Issues:0

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:PythonLicense:Apache-2.0Stargazers:3280Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10349Issues:0Issues:0

yamllint

A linter for YAML files.

Language:PythonLicense:GPL-3.0Stargazers:2743Issues:0Issues:0

recurrent-memory-transformer

[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.

Language:Jupyter NotebookStargazers:743Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3046Issues:0Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10411Issues:0Issues:0

FastSAM

Fast Segment Anything

Language:PythonLicense:AGPL-3.0Stargazers:6992Issues:0Issues:0

LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

Language:HTMLLicense:MITStargazers:421Issues:0Issues:0

scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Language:PythonLicense:Apache-2.0Stargazers:11276Issues:0Issues:0

awesome-mlss

List of summer schools in machine learning + related fields across the globe

License:MITStargazers:2594Issues:0Issues:0

stumpy

STUMPY is a powerful and scalable Python library for modern time series analysis

Language:PythonLicense:NOASSERTIONStargazers:3047Issues:0Issues:0

foundryvtt-docker

An easy-to-deploy Dockerized Foundry Virtual Tabletop server.

Language:ShellLicense:MITStargazers:536Issues:0Issues:0

airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Language:PythonLicense:NOASSERTIONStargazers:14483Issues:0Issues:0

dbt-clickhouse

The Clickhouse plugin for dbt (data build tool)

Language:PythonLicense:Apache-2.0Stargazers:222Issues:0Issues:0

metabase

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

Language:ClojureLicense:NOASSERTIONStargazers:36983Issues:0Issues:0

ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Language:PythonLicense:MITStargazers:12147Issues:0Issues:0

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9105Issues:0Issues:0

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

Language:C++License:MITStargazers:945Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:62566Issues:0Issues:0

sense2vec

🦆 Contextually-keyed word vectors

Language:PythonLicense:MITStargazers:1604Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9684Issues:0Issues:0

pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Language:PythonLicense:NOASSERTIONStargazers:435Issues:0Issues:0

GifCapture

🏇 Gif capture app for macOS

Language:SwiftLicense:NOASSERTIONStargazers:920Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:16951Issues:0Issues:0

Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

Language:PythonLicense:MITStargazers:313Issues:0Issues:0

Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!

Language:TypeScriptLicense:MITStargazers:866Issues:0Issues:0