Conditions
First, you will want a CSV containing all of your inside hyperlinks.
On the very least, your file ought to have a column for the supply (origin of the interior hyperlink) and a column for the vacation spot (the place the hyperlink leads). Nonetheless, if doable, embody extra columns for the anchor, standing code, and kind of hyperlink (akin to picture, textual content, and hreflang) to complement your evaluation.
For instance, I used information from my company’s web site. Whereas it’s a small web site with 1,678 pages (together with redirects and misguided pages), it accommodates 338,656 hyperlinks when accounting for CSS, JavaScript, sitemaps, and extra. Though manageable in a uncooked Excel sheet, making use of customized formulation and filters may turn out to be difficult.
Under are my CSV columns:
-
Kind: Identifies whether or not the hyperlink is from a sitemap, hreflang, canonical, easy hyperlink, picture, CSS, and so on.
-
Supply: The web page the place the hyperlink is situated.
-
Vacation spot: Factors to the goal web page.
-
Alt Textual content: If the hyperlink is a picture, this column accommodates its alt attribute textual content.
-
Anchor: The HTTP standing code of the vacation spot.
-
Standing: The standing of the vacation spot (e.g., canonicalized, non-indexable).
-
Comply with: Helpful for figuring out if the hyperlink impacts search engine marketing.
-
Hyperlink place: Signifies if the hyperlink is within the navigation, head, content material, or elsewhere. Make sure the instrument settings are correct.
-
Hyperlink origin: Specifies whether or not the hyperlink is simply current within the HTML or the rendered HTML post-JavaScript execution. That is useful in troubleshooting JavaScript-rich web sites.
With the file prepared and a Google Cloud account arrange, what’s subsequent?
There are two choices:
1. In case your file is beneath 100 MB, add it instantly by way of the BigQuery interface
2. For bigger information, use Cloud Storage.
Though the method is much like the primary possibility, I will discover the second possibility (as my file exceeds 100 MB).
Create a bucket and cargo the CSV
1. Return to the Cloud Hub and click on “Cloud Storage” on the backside left.