NarmadhaMM / IntroDataScience

Book Draft: Introduction to Data Science (https://scientistcafe.com/ids/)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This is a draft of the book Introduction to Data Science

Please note that this work is being written under a Contributor Code of Conduct and released under a CC-BY-NC-SA license. By participating in this project (for example, by submitting a pull request with suggestions or edits) you agree to abide by its terms.

Goals of the Book

Data Science is a cross-disciplinary subject involving hands-on experience and business problem-solving exposures. The majority of existing introduction books in data science are about the modeling techniques and implementation using R or Python. However, they fail to introduce data science in a context of the industrial environment. Moreover, a crucial part, the art of data science in practice, is often missing. This book focuses on the hands-on and business problem-solving such that readers can quickly and easily transfer knowledge to real-world practice.

Based on industry experience, this book outlines the real world scenario and points out pitfalls data science practitioners should avoid. It also covers big data cloud platform and the art of data science such as soft skills. We use R as the main tool and provide code for both R and Python.

Our Target Audience

This text is intended for a broad audience as both an introduction to data science as well as a guide to applying them. Non-mathematical readers will appreciate the emphasis on problem-solving with real data across a wide variety of applications and the reproducibility of the companion R and python code.

Readers should know basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics.

Key Features

  • Comprehensive: covers not only technical skills but also soft skills and big data environment in industry
  • Hands-on using both R and Python
  • Repeatable
  • End-to-end cycle of a typical data science project
  • Pitfalls that everyone should avoid

Others

  • CRC Press
  • Deadline: Dec 1, 2019
  • Approximately 450 ~ 500 pages
  • Color book

Short links:

About

Book Draft: Introduction to Data Science (https://scientistcafe.com/ids/)

License:Creative Commons Zero v1.0 Universal


Languages

Language:HTML 61.2%Language:Jupyter Notebook 30.6%Language:TeX 7.9%Language:CSS 0.3%