+ View Gallery

Published Date: May 6, 2018

Available In



In the ZIP file, you will get a) the self instructed recipe (code) - R script (DSR-014.r), b) A short tutorial of applied machine learning (DSR-014.pdf) and c) the dataset used in the recipe - IRIS dataset (iris.data.csv).

Sample Codes


Visited 344 times , 1 Visit today

In this Data Science Recipe, you will learn:

  1. Different types of Machine Learning problems.
  2. How to organise a Predictive Modelling Machine Learning project.
  3. Different types of data used for predictive modelling.
  4. Different elements of data used for predictive modelling.
  5. How to install R and MySQL.
  6. How to implement Decision Tree for Multiclass Classification Algorithm in R.
  7. How to tune parameters: manual tuning and automatic tuning in R.
  8. How to compare Algorithms with Accuracy and Kappa using caret package in R.

What is Machine Learning?

Machine learning is the science of getting computers to act without being explicitly program. It is a subset of AI: Artificial Intelligence. Predictive modelling is a branch of Machine Learning that particularly deals with tabular data to explicitly find patterns and/or insights from the data available.

Types of Machine Learning Problems

There are common classes of problems in Machine Learning. The problems discussed below are standards for most of the ML based predictive modelling problems.

  • Classification (or Supervised Learning): Data are labelled meaning that they are assigned to classes, for example spam/non-spam or fraud/non-fraud. The decision being modelled is to assign labels to new unlabelled pieces of data. Classification should be Binary classification and Multi-class classification.
  • Regression (or Supervised Learning): Data are labelled with a real value (think of a real number) rather than a label/class. Examples that are easy to understand are time series data like the price of a stock over time, monthly sales volume of a store etc. The decision being modelled is what value to predict for new unpredicted data.
  • Clustering (or Unsupervised Learning): Data are not labelled, but can be divided into groups based on similarity and other measures of natural structure in the data.





The information and recipe presented within this eArticle/code is only for educational and coaching purposes for beginners and learners. Anyone can practice and apply the recipe presented here, but the reader is taking full responsibility for his/her actions. 
The author of this recipe (code / program) has made every effort to ensure the accuracy of the information was correct at time of publication. The author does not assume and hereby disclaims any liability to any party for any loss, damage, or disruption caused by errors or omissions, whether such errors or omissions result from accident, negligence, or any other cause. Some of the information presented here could also be found in public knowledge domains.


Leave a Reply

Your Rating for this listing: