tayueyue

tayueyue

Geek Repo

Github PK Tool:Github PK Tool

tayueyue's starred repositories

diff-match-patch

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

Language:PythonLicense:Apache-2.0Stargazers:7321Issues:0Issues:0

text_matching

常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中

Language:PythonLicense:Apache-2.0Stargazers:670Issues:0Issues:0

simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Language:PythonLicense:Apache-2.0Stargazers:303Issues:0Issues:0

wiki_zh_word2vec

利用Python构建Wiki中文语料词向量模型试验

Language:PythonStargazers:498Issues:0Issues:0

similarities

Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:693Issues:0Issues:0

GMS-Feature-Matcher

GMS: Grid-based Motion Statistics for Fast, Ultra-robust Feature Correspondence (CVPR 17 & IJCV 20)

Language:PythonLicense:BSD-3-ClauseStargazers:1070Issues:0Issues:0

DeepMatch

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.

Language:PythonLicense:Apache-2.0Stargazers:2187Issues:0Issues:0

qdrant

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustLicense:Apache-2.0Stargazers:19153Issues:0Issues:0

tinyid

ID Generator id生成器 分布式id生成系统,简单易用、高性能、高可用的id生成系统

Language:JavaLicense:Apache-2.0Stargazers:2258Issues:0Issues:0

Qualitis

(:star:个人注解版) Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

LarkMidTable

LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。

Language:JavaLicense:Apache-2.0Stargazers:1733Issues:0Issues:0

alldata

AllData数据中台开源项目,以数据平台为底座,以数据中台为桥梁,以机器学习平台为中层框架,以大模型应用为上游产品,提供全链路数字化解决方案。加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo

Language:JavaLicense:GPL-3.0Stargazers:2409Issues:0Issues:0

featureform

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:1756Issues:0Issues:0

soda-core

:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Language:PythonLicense:Apache-2.0Stargazers:1835Issues:0Issues:0

data-diff

Compare tables within or across databases

Language:PythonLicense:MITStargazers:2921Issues:0Issues:0

etcd

Distributed reliable key-value store for the most critical data of a distributed system

Language:GoLicense:Apache-2.0Stargazers:46932Issues:0Issues:0

sled

the champagne of beta embedded databases

Language:RustLicense:Apache-2.0Stargazers:7932Issues:0Issues:0

Chat2DB

🔥🔥🔥AI-driven data management platform Over 1 million developers are using Chat2DB

Language:JavaLicense:Apache-2.0Stargazers:14372Issues:0Issues:0

libevent

Event notification library

Language:CLicense:NOASSERTIONStargazers:10946Issues:0Issues:0

NovaLSM

Nova-LSM is a component-based design of the LSM-tree using fast and high bandwidth networks such as RDMA.

Language:C++License:BSD-3-ClauseStargazers:52Issues:0Issues:0

griddb

GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.

Language:C++License:AGPL-3.0Stargazers:2350Issues:0Issues:0

canal_mysql_nosql_sync

基于canal 的 mysql 与 redis/memcached/mongodb 的 nosql 数据实时同步方案 案例 demo canal client

Language:JavaStargazers:1427Issues:0Issues:0

SQLAdvisor

输入SQL,输出索引优化建议

Language:CLicense:GPL-2.0Stargazers:5533Issues:0Issues:0

sqlmap

Automatic SQL injection and database takeover tool

Language:PythonLicense:NOASSERTIONStargazers:31431Issues:0Issues:0

sql-parser

SQL Parser for C++. Building C++ object structure from SQL statements.

Language:C++License:MITStargazers:723Issues:0Issues:0

sqlparser

SQL Parser implemented in Go

Language:GoLicense:Apache-2.0Stargazers:1450Issues:0Issues:0

JSqlParser

JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern

Language:JavaLicense:Apache-2.0Stargazers:5223Issues:0Issues:0

wdt

Warp speed Data Transfer (WDT) is an embeddedable library (and command line tool) aiming to transfer data between 2 systems as fast as possible over multiple TCP paths.

Language:C++License:NOASSERTIONStargazers:2853Issues:0Issues:0

squangle

SQuangLe is a C++ API for accessing MySQL servers

Language:C++License:NOASSERTIONStargazers:121Issues:0Issues:0

folly

An open-source C++ library developed and used at Facebook.

Language:C++License:Apache-2.0Stargazers:27616Issues:0Issues:0