venkata_d (Venkata09)

Venkata09

Geek Repo

Company:Capitalone

Location:Virginia

Home Page:https://venkata09.github.io/dataworks/

Github PK Tool:Github PK Tool

venkata_d's starred repositories

java-design-patterns

Design patterns implemented in Java

Language:JavaLicense:NOASSERTIONStargazers:88474Issues:3783Issues:937

interviews

Everything you need to know to get the job.

Language:JavaLicense:MITStargazers:62874Issues:2611Issues:53

handson-ml2

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:27376Issues:656Issues:510

incubator-seata

:fire: Seata is an easy-to-use, high-performance, open source distributed transaction solution.

Language:JavaLicense:Apache-2.0Stargazers:25114Issues:848Issues:3763

Sentinel

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)

Language:JavaLicense:Apache-2.0Stargazers:22186Issues:789Issues:2239
Language:TeXStargazers:18334Issues:0Issues:0

TensorFlow-Tutorials

TensorFlow Tutorials with YouTube Videos

Language:Jupyter NotebookLicense:MITStargazers:9266Issues:543Issues:110

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7300Issues:219Issues:1440

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:5235Issues:1171Issues:3108

SynapseML

Simple and Distributed Machine Learning

Language:ScalaLicense:MITStargazers:5020Issues:146Issues:716

ftgo-application

Example code for the book Microservice patterns

Language:JavaLicense:NOASSERTIONStargazers:3358Issues:189Issues:117

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3190Issues:81Issues:333

BigDL-2.x

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2652Issues:108Issues:1

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

Language:PythonLicense:MITStargazers:2288Issues:62Issues:40

interviewpen

Code samples for Back to Back SWE lessons (archive).

awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

License:MITStargazers:1050Issues:41Issues:0

rxjava-jdbc

Efficient execution and functional composition of database calls using jdbc and RxJava Observables

Language:JavaLicense:Apache-2.0Stargazers:807Issues:54Issues:66

spark-daria

Essential Spark extensions and helper methods ✨😲

Language:ScalaLicense:MITStargazers:743Issues:33Issues:73

cassandra-reaper

Automated Repair Awesomeness for Apache Cassandra

Language:JavaLicense:Apache-2.0Stargazers:483Issues:38Issues:770

spark-sql-internals

The Internals of Spark SQL

rxjava2-jdbc

RxJava2 integration with JDBC including Non-blocking Connection Pools

Language:JavaLicense:Apache-2.0Stargazers:390Issues:26Issues:57

LeetCode

LeetCode各题解法分析~(Java and Python)

Text-Summarizer-Pytorch

Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network

kaggle-freesound-audio-tagging

8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)

Language:PythonLicense:MITStargazers:113Issues:9Issues:4

waimak

Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.

Language:ScalaLicense:Apache-2.0Stargazers:75Issues:13Issues:36

dl4s

source code accompanying "Deep Learning for Search" book

Spark

Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References

Language:Jupyter NotebookStargazers:70Issues:9Issues:1

hdfs-metadata

Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.

Language:JavaLicense:GPL-3.0Stargazers:56Issues:15Issues:3

quasi-rnn

A PyTorch Implementation of "Quasi-Recurrent Neural Networks"

etl-light

A light Kafka to HDFS/S3 ETL library based on Apache Spark

Language:ScalaLicense:MITStargazers:42Issues:5Issues:1