This book introduces data science and artificial intelligence using 『eStat』, R and Python. Project leader: Professor Jung Jin Lee, email: jjlee@ssu.ac.kr
This work is in the public domain. Therefore, it can be copied and reproduced without limitation. However, we would appreciate the citation of 『eStat』, http://www.estat.me. |
『eStat』 is a web-based freeware for statistics education which can be used anytime and anywhere using PC, tablet, or mobile phone. |
R is a free software environment for statistical computing and graphics.
R site and to download; https://www.r-project.org
|
Python is a free software environment for statistical computing and graphics.
Python site and to download; https://www.python.org
|
Jung Jin Lee
Emeritus Professor, Soongsil University, Korea Professor, ADA University, Azerbaijan Ph.D. in Operations Research, Case Western Reserve University M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University President, Korean Statistical Society Vice President, International Association for Statistical Computing Council Member, International Statistical Institute (ISI) |
Tae Rim Lee Emeritus Professor, Korea National Open University Ph.D. in Statistics, Choongang University M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University Vice President, Korean Statistical Society Vice President, International Association for Statistics Education Vice President, International Biometric Society |
Geunseog Kang Emeritus Professor, Soongsil University, Korea Ph.D. in Statistics, University of Wisconsin - Madison M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University |
Sung Soo Kim Emeritus Professor, Korea National Open University Ph.D. in Statistics, Seoul National University M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University |
Heon Jin Park Professor, Inha University Ph.D. in Statistics, Iowa Stat University M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University President, Korean Data Mining Society Dean, College of Natural Science, Inha University |
Song Yong Sim Hallym University Ph.D. in Statistics, University of Wisconsin - Madison M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University |
Yoon Dong Lee Professor, Sogang University Ph.D. in Statistics, Iowa State University M.S. in Statistics, Seoul National University B.S. in Computer Science and Statistics, Seoul National University |
Hyun Jo You Professor, Chungnam National University Ph.D in Statistics, Soongsil University M.S., Ph.D in Linguistics, Seoul National University B.S. in Micro-biology, |
Hulisi Ogut Professor, ADA University, Azerbaijan Ph.D. in Management Science, University of Texas at Dallas M.S., University of Texas at Dallas M.A., Boston University B.A., Bilkent University |
The 4th Industrial Revolution aims at super-connectedness, super-intelligence and super-forecasting and many new changes will occur in our lives revolutionary. The revolution would help us to solve many problems, but it would also give us new challenges to be solved at the same time. The biggest challenge is analysis and utilization of Big Data.
The analysis of Big Data can be done by multi-disciplinary areas such as Statistics, Mathematics, Computer Science, and other application areas such as Management which is called Data Science. Data Science is primarily based on traditional statistical methods, applied mathematics, and requires lots of data manipulation using computer software such as R, SAS and SPSS which are widely used require some training from professionals. Authors of this book have been developed 『eStat』 for years which can help all level of students to learn Data Science easily.
This book introduces basic visualization data in Chapter 2, data summary and trasformation in Chater 3. Chapter 4 and 5 review basic statistical model for big data analysis. Chapter 6 and 7 discuss models of supervised machine learning, and Chapter 8 discusses models of unsupervised machine learning. Chapter 9 introduces artificial intelligence and other applications of data science.
I appreciate all of you who have developed 『eStat』 together over the past few years. I appreciate also to all internet communities who have helped us during the development of『eStat』.
Spring 2025
Project Leader: Jung Jin Lee