answer-
1.Take the whole data set as the input.
2.calculate entropy of the target variable as well as teh predictor attributes.
3.calculate your information gain of all attributes.
4.choose the attribute with the highest information gain as the root node.
5.repeat the same procedure on every branch until decision node of each branch is finalised.