Blog Post

SEO

Nadine

Wolff

published on:

31.05.2018

Visualizing Internal Linking with Gephi – Part 2: Your Website in Color

Table of Contents

No table of contents available
No table of contents available
No table of contents available

Visualizing Internal Linking - with Gephi, it's possible. In the first article on Gephi, the procedure was explained to properly prepare the necessary data. This article focuses on Gephi itself and how the internal linking of the website can be correctly visualized.

gephi interne Verlinkung visualisieren

Visualizing Internal Linking

With the prepared data table, we can now start Gephi. After launching, we can open a new project directly via the pop-up. In the top menu, the table must now be imported under Data Laboratory.

Datenlabor im Menü

Fig. 2 Data Laboratory in the menu

There, under "Import tables," you can select the table. To proceed, the following settings should be selected. This should be the default setting.

Einstellung Importieren der Tabelle

Fig. 3 Import settings for the table

Then click "Next" and "Finish." The import report usually shows errors, which can be ignored. With the settings "Graph type: Directed" and choosing "New workspace," the process can be completed.

Displaying Internal Linking

In Overview, the internal linking display can now be done. It still looks very confusing. Each of these points represents a specific page, and each line to another point stands for an internal link. Now, at the bottom left under Layout, you can redesign the orientation. There are different models to choose from. For a larger domain representation, we found that the Yifan Hu Proportional model worked best. Depending on the domain, other settings might work better; here is a suggestion from us:

Einstellung Yifan Proportional

Fig. 4 Settings in Yifan Hu Proportional

These modified metrics stand for the following:

  • Optimal distance: Size of the representation

  • Relative strength: Distance between points

  • Initial step size: Calculation speed (Gephi recommends 10% of the optimal distance)

With the start button, the layout is calculated, and an initial structure can already be seen. This process should be done 3 to 4 times; then the calculation should be mostly finished and the image should be final.

Fig. 5 Example of a website in Gephi

However, the graphic is still not really workable, let alone allows recognizing measures. This is where modularity and PageRank come into play.

Internal PageRank

For some SEOs, the mention of PageRank might raise eyebrows, recalling 10 years ago. But unlike Google's PageRank, which indicates how important a domain is within the internet (simplified), the internal PageRank reflects the most significant pages within the domain. Pages with a high PageRank are frequently linked internally and are also linked by pages with a high PageRank themselves.

On the right side, under Statistics, the internal PageRank can be calculated. If the window is not displayed, it must first be activated. Under the main menu Windows, it can be set again. This applies to all views in the dashboard.

pagerank berechnen

Fig. 6 Calculate internal PageRank

For the best results in PageRank calculation, use the following settings:

interner PageRank berechnen

Fig. 7 Internal PageRank settings

This allows for a better simulation of website dynamics. The subsequent window can simply be closed. The data has been calculated, but nothing has changed (yet). With the calculated data, the node size can be displayed anew. For this, the following settings need to be made at the top right under Customize:

Knotengröße in Gephi anpassen

Fig. 8 Adjust node size in Gephi

With Apply, the settings are applied and the image changes. The larger the node is now displayed, the higher the internal PageRank. But which page is behind each point? To have them displayed, apply the following settings:

Seiten in Gephi anzeigen

Fig. 9 Display pages in Gephi

If many nodes are bunched together and you want to take a closer look at one, you can click on it with the left mouse button and drag it away. This allows the image to be changed individually for better clarity. The original state can always be restored via layout calculation.

Visualizing URL Clusters

The individual nodes are arranged, and the size of the nodes indicates the domain's focal points. But before analyzing the existing data and determining measures, we need to recognize topic clusters. In this case, content is secondary because good topic clusters are also reflected in the internal linking. For this, we use modularity, located directly above PageRank on the bar.

Gephi calculates, through internal linking, which pages belong together and thus form a cluster. The modularity settings can be used in the default version. Afterwards, the node colors can be displayed. This should be done as follows:

Cluster darstellen in Gephi

Fig. 10: Display clusters in Gephi

With modularity, you can now clearly see the clusters on the website, represented by the internal linking. The more nodes with the same color are interconnected and the more separate the clusters are, the better linked they are and perceived as a cluster. Since the tool only calculates this, there is no subjectivity involved.

Here is an example with a brief interpretation:

In the example image, you can already see very good internal linking. Several clusters are clearly displayed, and there are not many small clusters. However, the purple cluster is not consistently strongly interlinked, which is why they do not lie together. Furthermore, the blue cluster and the purple cluster mix with each other, suggesting a lack of clear cluster definition. Here, the concept of areas needs to be reconsidered.

Fig. 11: Colorful clusters in Gephi

Additional Examples from the Domain

The given example is already good for cluster representation, but to give a little insight into Gephi, here are 2 more examples.

Positives Beispiel für die interne Verlinkung

Fig. 12: Positive example for internal linking

Here, domain structures can be clearly recognized. The areas are distinctly separated and show only a few interfaces. The domain shows a logical and very well-structured internal linking, which can only be improved marginally.

negatives Beispiel interne Verlinkung visualisieren

Fig. 13: Negative example for internal linking

Here, no clusters can be recognized. The tool seems to see hardly any connections. Almost all pages are equally relevant, and the pages in the middle of the graphic are strongly linked through the main menu. The entire internal linking concept may need to be revamped.

Measures for Internal Linking

We are now at a point where the website leaves the abstract level and is visualized. The concept for internal linking is either sensible or just there to ensure all pages are somehow reachable. This is necessary, of course, but internal linking must offer more to effectively support ranking.

Let's take it a step further. With an existing concept, which already shows a cluster structure in Gephi, how does Gephi assist further? The data calculated so far looks nice, but it still needs to be analyzed and measures derived. For this, data can be exported from the Data Laboratory.

Improving Internal PageRank

With the export, you now have a list that reflects the internal PageRank for each URL. This raises the following questions:

  • Which page is the most important according to this calculation?

  • Which pages should have a high PageRank?

  • How can I best influence the PageRank?

An internal prioritization should already be in the sitemap.xml. This can also be added for reference. Otherwise, individual prioritization certainly doesn’t hurt (shouldn't pages that generate the most revenue also have a good internal PageRank?). The following measures are also conceivable:

  • Pages with a high internal PageRank, but which are not actually relevant, should be linked or used less often to pass PageRank to other pages. Few internal links can help here.

  • Pages with a high PageRank should also not link to too many external pages, as this wastes internal potential.

  • Relevant pages with a low PageRank should be linked internally more frequently.

It is crucial to use internal linking to present relevant pages to Google.

Optimizing Topic Clusters

Like PageRank, modularity can also be exported. Important questions arise here:

  • How many clusters are there?

  • Which pages should be in a cluster?

  • How can I better influence cluster formation?

During a comprehensive keyword research, clustering should already have been done, which can now be checked. Otherwise, a topic clustering (which URL should be in which cluster) should be created. The following measures are also conceivable:

  • Pages that are thematically close, but depicted in different clusters. These should increasingly receive links from a cluster.

  • Small clusters that are depicted but should actually appear in a larger cluster. These should also receive more links from the large clusters.

It’s important to have as few clusters as necessary while displaying all thematically related pages in one cluster.

Visualization of Internal Linking – Conclusion

With Gephi, something abstract like visualizing internal linking becomes tangible. The tool enables a new form of analysis but must be appropriately prepared and structured. Only then can action derivations truly be recognized and internal linking be strengthened. A comprehensive analysis is worthwhile only when a concept exists and has been implemented. Otherwise, the representations do not yield significant insights.

What can we do for you?

Are you now looking for an online marketing agency to help you visualize your internal linking? We would be happy to advise you, from strategic creative plans to operational implementation. We look forward to your inquiry.

Nadine

Wolff

As a long-time expert in SEO (and web analytics), Nadine Wolff has been working with internetwarriors since 2015. She leads the SEO & Web Analytics team and is passionate about all the (sometimes quirky) innovations from Google and the other major search engines. In the SEO field, Nadine has published articles in Website Boosting and looks forward to professional workshops and sustainable organic exchanges.

no comments yet

Address

Bülowstraße 66

Aufgang D3

10783 Berlin

Newsletter

Address

Bülowstraße 66

Aufgang D3

10783 Berlin

Newsletter

Address

Bülowstraße 66

Aufgang D3

10783 Berlin

Newsletter