cmblir / dinnovation

It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center. In addition, various libraries are used in the project, which are available under the Apache 2.0 license.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Download market data from various information sites

*** Important Legal Disclaimer ***

Please note that dinnovation is not affiliated, endorsed, or vetted by any source sites. Use at your own risk and discretion.

For more information about the rights to use the actual data you downloaded, see the Terms of Use for each site. dinnovation is for personal use only.


Python version PyPi version PyPi status


Digital Industry Innovation Data Platform Big data collection and processing, database loading, distribution

It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center. In addition, various libraries are used in the project, which are available under the Apache 2.0 license.

Requirements

required python version

Python >= 3.9

To install the related library, use the command below. pip install requirements.txt or python setup.py install

To install the related libray pip install dinnnovation

required library

pandas==1.5.3
numpy==1.24.2
tqdm==4.64.1
OpenDartReader==0.2.1
beautifulsoup4==4.11.2
urllib3==1.26.14
selenium==4.8.2
webdriver_manager==3.8.5
chromedriver_autoinstaller==0.4.0
psycopg2==2.9.5
sqlalchemy==2.0.4
cryptography==41.0.3

Dinnovation Architecture

dinnovation_module

  • Dinnovation's architecture is built on efficiency. Data from multiple sites can be collected, processed, and loaded with one module.

Dinnovation Process

auto_process

  • Dinnovation's Auto Process automatically collects, processes, and loads data using the scheduler. This can be done more efficiently than simply using a module.

Dinnovation Workflow

github_workflow

  • Dinnovation's workflow was created by a single developer. This will automatically update the package and the release of Github when you modify the code and upload it to Git for the efficiency of your work. In addition, version management is also automatically.

How to use

Guide

About

It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center. In addition, various libraries are used in the project, which are available under the Apache 2.0 license.

License:Apache License 2.0


Languages

Language:Python 89.9%Language:Jupyter Notebook 10.1%