As shown by the figures above, the response variable, rent prices, is observed to be slightly normally distributed. The QQ-plot suggest that there are some deviations from the line, especially at the beginning, however the data later begins to settle more closely to the line. The histogram suggest that there is a bell curve shape, as seen in normal distribution. Although, the graph does seem to be more right skewed.
The univariate analysis illustrates that, Size, BHK and Bathroom are right skewed. City is well distributed amongst the data set. Furnishing status has the least frequent category as furnished rental places. The majority of properties opt for Bachelors/Family, and most properties are directly contacted through their owner. Finally, most properties are posted between May and June. With this in mind we can begin to fit the model and investigate which of these variables will have an influence in the renting prices observed.
Initial Model
Final model
Final Correlation Matrix
The first diagram, which depicts the Absolute vs. Predicted Residuals, shows how the variability is somewhat constant as most of the data is found within the same general area in both axes. The QQ-plot and the histogram for the residuals shows that the data is normally distributed. As the points in the QQ-Plot follow the line and the Histogram has a bell-shaped curve distribution. The residuals have also been visualized to be independent as the Order of Collection vs. Residuals diagram shows no trend in the data. The rest of the diagrams demonstrate their linearity related to the outcome and the explanatory variable themselves. Since all the assumptions of the model are not infringed this means, the model has been fit adequately, and no further transformations are needed.