This book introduces data science and artificial intelligence using 『eStat』 and R. Project leader: Professor Jung Jin Lee, email: jjlee@ssu.ac.kr
This work is in the public domain. Therefore, it can be copied and reproduced without limitation. However, we would appreciate the citation of 『eStat』, http://www.estat.me. |
『eStat』 is a web-based freeware for statistics education which can be used anytime and anywhere using PC, tablet, or mobile phone. |
R is a free software environment for statistical computing and graphics.
R site and to download; https://www.r-project.org
|
Jung Jin Lee
Emeritus Professor, Soongsil University, Korea Professor, ADA University, Azerbaijan B.S. and M.S., Seoul National University Ph.D. in O.R., Case Western Reserve University President, Korean Statistical Society Vice President, International Association for Statistical Computing Council Member, International Statistical Institute (ISI) |
Tae Rim Lee Emeritus Professor, Korea National Open University B.S. and M.S., Seoul National University Ph.D. in Statistics, Choongang University Vice President, Korean Statistical Society Vice President, International Association for Statistics Education Vice President, International Biometric Society |
Geunseog Kang Emeritus Professor, Soongsil University, Korea B.S. and M.S., Seoul National University Ph.D. in Statistics, University of Wisconsin - Madison |
Sung Soo Kim Professor, Korea National Open University B.S., M.S., Ph.D. in Statistics, Seoul National University |
Heon Jin Park Professor, Inha University B.S. and M.S., Seoul National University Ph.D. in Statistics, Iowa Stat University President, Korean Data Mining Society |
Song Yong Sim Hallym University B.S. and M.S., Seoul National University Ph.D. in Statistics, University of Wisconsin - Madison |
Yoon Dong Lee Professor, Sogang University B.S. and M.S., Seoul National University Ph.D. in Statistics, Iowa State University |
Hyun Jo You Professor, Chungnam National University B.S., M.S., Ph.D in Linguistics, Seoul National University Ph.D in Statistics, Soongsil University |
The 4th Industrial Revolution aims at super-connectedness, super-intelligence and super-forecasting and many new changes will occur in our lives revolutionary. The revolution would help us to solve many problems, but it would also give us new challenges to be solved at the same time. The biggest challenge is analysis and utilization of Big Data.
The analysis of Big Data can be done by multi-disciplinary areas such as Statistics, Mathematics, Computer Science, and other application areas such as Management which is called Data Science. Data Science is primarily based on traditional statistical methods, applied mathematics, and requires lots of data manipulation using computer software such as R, SAS and SPSS which are widely used require some training from professionals. Authors of this book have been developed 『eStat』 for years which can help all level of students to learn Data Science easily.
This book introduces basic visualization data in Chapter 2, data summary and trasformation in Chater 3. Chapter 4 and 5 review basic statistical model for big data analysis. Chapter 6 and 7 discuss models of supervised machine learning, and Chapter 8 discusses models of unsupervised machine learning. Chapter 9 introduces artificial intelligence and other applications of data science.
I appreciate all of you who have developed 『eStat』 together over the past few years. I appreciate also to all internet communities who have helped us during the development of『eStat』.
Spring 2025
Project Leader: Jung Jin Lee