Archive for the ‘Research’ Category
Compare two Lists
Sometimes, it may be necessary to compare data between two columns in an Excel spreadsheet. It is particularly important, when analysing backlinks for example, to de-duplicate lists of data into some kind of useful list of domains that is easier… Continue reading
In this article, we will show how data transformations can be an important tool for the proper statistical analysis of data. The association, or correlation, between two variables can be visualised by creating a scatterplot of the data. In certain instances, it may appear that the relationship between the two variables is not linear; in such a case, a linear correlation analysis may still… Continue reading
Review of Previous Parts
In Part 1, we described how two datasets can have strong, weak or no correlation depending on how the points in the resultant scatter plot are located relative to one another along a straight line. In keeping with conventional practice, the two variables are often referred to separately as X, with sample values x1, x2, … , xn, and Y… Continue reading
In Parts 1, 2 and 3, we looked at statistical measures of datasets that were purely one-dimensional in nature, i.e., all the observations constituting the dataset were of the same type. Examples of such datasets are the time elapsed between eruptions of the Old Faithful Geyser
This short blog is in response to a recent support ticket querying whether particular domains display significant differences in terms of the ratio of referring subnets to referring IP addresses. We were asked what a “normal” value of this ratio could be.