Prediction of Stroke possibilities using various Classification Models

Author(s): Kameshwaran Ganesan, Pavithra Mamallan


This problem concerns with predicting whether the patient will get stroke in the future with predictors like age, gender, smoking status, body mass index, whether they had heart disease, whether they had hypertension etc. Since the output variable is categorical in nature, it is a classification problem. Many classification techniques are used with the help of three main Business Analytics tools such as Excel, R and Python. The data is understood through Exploratory Data Analysis, then Data Pre- Processing is done to prepare the data, various models are built on the data, and finally, Error metrics are used to compare the results.