This is common even for relatively simple sites that consistently produce blog content over time. Now you have a big list of URLs with the following combinations: duplicate content Pages that are indexed but not generating impressions Pages with very low engagement indicators (high bounce rate, short site stay, etc.) Now comes the hardest part. It actually removes these pages from the index to fix the "index bloat" issue. Related Content: How to Set Goals and Goal Achievement Processes in Google Analytics Step 3: Remove Low Quality Pages to Fix Indexing Issues
The first step to really whatsapp database sorting out your indexing problem is to identify the types of duplicate, low-quality content that exist in your index. As a result of the first two steps, we need a large list of URLs with possible duplicates or low values, but how do we actually deal with these issues? I like to start by categorizing the different types of problems that the site is suffering from. This can be very time consuming as you need to see each URL displayed in these reports. If you've reviewed many different sites, this is pretty proficient and often very quick to identify, but if you're new to indexing issues, you can dig into it in the following ways:
First, you need to make sure that the page in question is actually a duplicate or thin page that you are considering deleting. Then you need to look for the cause of the problem you are experiencing. Why is this page thin or duplicated? Most of the time, the reason URLs are duplicated is because they are repeated on various different pages. To see an example, let's dig into some Pottermore reports. Looking at Screaming Frog's duplicate title tag report, there are several pages with the same title tag. image25 This is effectively the same content at different URLs and subdomains. However, in this case, the site is configured to serve different shopping pages based on geography.