Beginners Guide to Log File Analysis (2022)

Log file analysis was one of the primary SEO techniques during the early years of search and digital marketing, but the rise of various SaaS products has led to it becoming a dying art. However, there are still insights that can be gained from analysis of your raw access logs and we hope to provide you the information you need to decide whether it’s a worthwhile investment of your time and how to conduct it if it may be.

What is log file analysis?

Log file analysis is the process of either manually, or using a tool or platform, reviewing the data that is stored by your site’s servers whenever a request for a resource (web page, CSS/JS file, image etc.) is registered. In doing so, the analyser can reveal issues with various parts of the site, possible SEO opportunities and the general behaviour of various search engine crawlers that roam the web.

What do log files include?

While log files can be extremely useful for SEO, the information stored is pretty basic – this includes:

The HTTP or ‘status code’ of your website’s server response (2XX, 3XX, 4XX etc.)
The IP address of the user agent (the software that retrieves, renders and allows for easy use of the web)
The type of request – either GET or POST depending on whether it’s a request to receive or provide data
A time stamp which states the date and time the request was received by the server
The URL path of the resource requested (the image, web page or file URL)
The user agent requesting the resource – generally a web browser such as Chrome, Mozilla etc.

These files will vary in size depending on how large the site is, how much traffic the site gets and how regularly the logs are archived.

Where can you find your site’s log files?

The only servers I have access to at the point of writing use the cPanel GUI, but most are fairly similar, so while they may not be in the same place, they’ll generally have the same descriptions. So, to find your log files, you’ll need to access your server management platform or, if you use a CDN, you’ll find your logs there as the server won’t receive most of the data.

Once you’re in, you’ll have two potential options, you can select ‘File Manager’ and scroll down to the ‘Logs’ folder, or you can select the Raw Access option which will allow you to download the most recent files.

You’ll receive a zip file (gz or similar) which you can unzip with Winrar to allow you to open it in your preferred spreadsheet program or SaaS log file analyser. If you’re opening in a spreadsheet, however, you’ll generally need to separate the text into columns as it will generally paste into a single cell per hit.

If you’re in Excel, you can do this with the ‘Data’ tab, and the ‘Text to Columns’ option.

The result should be columns which will fit into the following structure (sometimes there are a couple more, sometimes a couple less):

IP Address
D/M/Y/HH:MM:SS
Method/Query
HTTP Status Code
File Size/Bytes Downloaded
User Agent

FAQ

What log files include

+

While log files can be extremely useful for SEO, the information stored is pretty basic – this includes:

The HTTP or ‘status code’ of your website’s server response (2XX, 3XX, 4XX etc.)
The IP address of the user agent (the software that retrieves, renders and allows for easy use of the web)
The type of request – either GET or POST depending on whether it’s a request to receive or provide data
A time stamp which states the date and time the request was received by the server
The URL path of the resource requested (the image, web page or file URL)
The user agent requesting the resource – generally a web browser such as Chrome, Mozilla etc.

These files will vary in size depending on how large the site is, how much traffic the site gets and how regularly the logs are archived.

Where to find your site’s log files

+

The only servers I have access to at the point of writing use the cPanel GUI, but most are fairly similar, so while they may not be in the same place, they’ll generally have the same descriptions. So, to find your log files, you’ll need to access your server management platform or, if you use a CDN, you’ll find your logs there as the server won’t receive most of the data.

Once you’re in, you’ll have two potential options, you can select ‘File Manager’ and scroll down to the ‘Logs’ folder, or you can select the Raw Access option which will allow you to download the most recent files.

You’ll receive a zip file (gz or similar) which you can unzip with Winrar to allow you to open it in your preferred spreadsheet program or SaaS log file analyser. If you’re opening in a spreadsheet, however, you’ll generally need to separate the text into columns as it will generally paste into a single cell per hit.

Contact us today

to see what our award winning teams can do for your brand

let's chat

News & Views

Digital Marketing

6 essential tips for choosing your perfect digital marketing partner

By Becky Gardiner - March 5th 2024

Content Marketing & Digital PR Health & Wellness

Our favourite 3 campaigns that inspire healthy living

By Sian Badich - April 22nd 2024

Digital Marketing

The most common challenges digital marketers face

By Isabella Meerdink - April 16th 2024

Our latest videos

Benchmark Conferences Click News Organic Search (SEO) Video

Benchmark Search & Digital Conference 2022

By Immy Williamson - November 4th 2022

Wednesday 2nd November saw Benchmark Search & Digital Conference return for the 6th time. Held in the Science & Industry...

Content Marketing & Digital PR Video

In Focus: Building and Maintaining Relationships for Digital PR

By Jorden Williams - October 12th 2022

Public Relations (PR) at its core is focused on the creation, building and maintenance of relationships, but when it comes...

Organic Search (SEO) Video

Viva la SEO

By Immy Williamson - August 31st 2022

Every year or so, maybe even twice a year I’m confronted with the statement ‘SEO is DEAD’, it’s a bit...

Chums has a clear vision of the performance we want from our PPC and SEO campaigns and Click Consult have consistently delivered this over the years.

I see Will Dixon and Charlotte Chapman, together with Peter Smith who manages our account, as part of our marketing team. They have each spent the time to understand my team personally, the wider business at Chums and our customers, all of which has been crucial to achieve the targets and growth we’ve set over the years.

I’m more than happy to recommend the team at Click

Paul Gray Marketing Director

Great agency at the forefront of search marketing. Fantastic account management coupled with real experts working on your campaigns = a winning combination.

Click Consult has helped us to develop an online marketing presence that continues to inch upwards. The team manages our account exceptionally, communicating clearly and frequently about the progress.

They are responsive and proactive in their approach and are considered an important component within our digital marketing activities.

Peter Lingley Chief Operating Officer