Dreamsome's repositories
pandasticsearch
An Elasticsearch client exposing DataFrame API
HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Prompt_Engineering_with_Qwen
Qwen 提示词工程 & 最佳实践
vivid_schemer
REPL for The Little Schemer
awesome-java-cn
Java资源大全中文版,包括开发库、开发工具、网站、博客、微信、微博等,由伯乐在线持续更新。
codelab-mindstorms
CodeLab Mindstorms关注编程教育, 计划翻译和解读编程教育领域优秀的探索者所做的工作。
httpie-hmac-auth
Auth plugin for debugging ODPS API with httpie
awesome-business-intelligence
Actively curated list of awesome BI tools. PRs welcome!
dbt_stripe_source
Fivetran's Stripe source dbt package
docs.getdbt.com
The code behind docs.getdbt.com
jedis-cluster-ext
extend jedis cluster to support pipeline
Orestes-Bloomfilter
Library of different Bloom filters in Java with optional Redis-backing, counting and many hashing options.
spark-janelia
scripts for using spark on janelia's cluster
spark-jobserver
REST job server for Apache Spark
Stream-Framework
Stream Framework is a Python library, which allows you to build newsfeed and notification systems using Cassandra and/or Redis.
xorbits
Scalable Python data science, in an API compatible & lightning fast way.