Archive for the ‘Research’ Category

Some Useful Excel Templates

By Neep Hazarika October 9, 2013

Compare two Lists

Compare two lists.

Compare two lists.

Sometimes, it may be necessary to compare data between two columns  in an Excel spreadsheet. It is particularly important, when analysing backlinks for example, to de-duplicate lists of data into some kind of useful list of domains that is easier… Continue reading

Posted In: Research

Correlation and Data Transformations

By Neep Hazarika September 26, 2013

In this article, we will show how data transformations can be an important tool for the proper statistical analysis of data. The association, or correlation, between two variables can be visualised by creating a scatterplot of the data. In certain instances, it may appear that the relationship between the two variables is not linear; in such a case, a linear correlation analysis may still… Continue reading

Posted In: Research

Majestic SEO Beginners Guide to Correlation: Part 5

By Neep Hazarika August 16, 2013

Review of Previous Parts

In Part 1, we described how two datasets can have strong, weak or no correlation depending on how the points in the resultant scatter plot are located relative to one another along a straight line. In keeping with conventional practice, the two variables are often referred to separately as X, with sample values x1, x2, … , xn, and Y… Continue reading

Posted In: Research

Majestic SEO Beginners Guide to Correlation: Part 4

By Neep Hazarika July 15, 2013

In Parts 1, 2 and 3, we looked at statistical measures of datasets that were purely one-dimensional in nature, i.e., all the observations constituting the dataset were of the same type.  Examples of such datasets are the time elapsed between eruptions of the Old Faithful Geyser

Posted In: Research

A Study of Subnet/ IP Link Ratios

By Neep Hazarika June 25, 2013

This short blog is in response to a recent support ticket querying whether particular domains display significant differences in terms of the ratio of referring subnets to referring IP addresses. We were asked what a “normal” value of this ratio could be.

A statistical study was performed on the Majestic Million data. We found that typical values lay within the range 1.2 to 1.8… Continue reading

Posted In: Research