Talk given at SEOntheBeach 2022, discussing what we did at Envato Elements and what worked for us in terms of crawl capacity management. Basically we wanted to show how you can signal Google into crawling and ingesting more of the site when the crawl capacity doesnt change.
2. Heads up..
際際滷s in English 鶲
Speako en Espa単ol 鶲
Gast坦n Riera - @gastonriera
3. At the end I'll share a 90%
discount for Elements!
Gast坦n Riera - @gastonriera
4. That's how I used to look,
all well dressed and all.
Gast坦n Riera - @gastonriera
Gast坦n Riera
5. Everything you need to get your
creative projects done.
Gast坦n Riera - @gastonriera
The big names:
Other very cool products:
6. Gast坦n Riera - @gastonriera
The two things I like the most about working at envato:
- Being sustainable and caring about the community
- Fully remote (ANZ/MX) and working from abroad
31. Content is not just text on the
page, but everything on it.
Every page is content.
Gast坦n Riera - @gastonriera
32. Battle_1: Content quality
Gast坦n Riera - @gastonriera
Two options:
1. Add content focussing on quality over
quantity.
2. Remove content from Google's index.
We already had +9M items!
33. Battle_1: Content quality
Gast坦n Riera - @gastonriera
Two options:
1. Add content focussing on quality over
quantity.
2. Remove content from Google's index.
We already had +9M items!
34. Battle_1: Content quality
Gast坦n Riera - @gastonriera
Two options:
1. Add content focussing on quality over
quantity.
2. Remove content from Google's index.
We already had +9M items!
35. Do you know what reduces the
content quality of any site?
Gast坦n Riera - @gastonriera
36. Do you know what reduces the
content quality of any site?
DUPLICATE CONTENT!
Gast坦n Riera - @gastonriera
37. Noindex and remove duplicates,
RUTHLESSLY
Gast坦n Riera - @gastonriera
Noindex a good part of
the items library.
-> Several million less
discoverable pages!
Why we decided to
noindex instead of
a fancier solution?
Ask me later
38. A few tips on how to get what to noindex?
- Use google's crawled not indexed as a proxy
- Check duplicate titles/urls/content description
- Just a different image doesn't make it a different page to
the eyes of Google!
Gast坦n Riera - @gastonriera
Battle_1: Content quality
39. Noindex and remove duplicates,
RUTHLESSLY
Gast坦n Riera - @gastonriera
Why the redirected path
had 15% of site's traf鍖c
and 20x the destination.
Ask me later
Merged two translations that ended up being way
more similar that intended
-> A few millions pages removed from Google.
40. Other big things we did
Turned Tag pages into Search pages
Search pages are noindex by default
The overall result? Decreased the index size to a half
without impacting organic traf鍖c.
Gast坦n Riera - @gastonriera
50%
Battle_1: Content quality
42. Battle_2: Internal linking
Gast坦n Riera - @gastonriera
Out of many tactics:
1. Reduce the number of crawl paths
2. Nofollow on links to low-value pages
44. Gast坦n Riera - @gastonriera
Link to only valuable pages
Added links between related
search pages
10% Organic traf鍖c!
If it's a useful search page,
it will not have a noindex.
Note that
45. Gast坦n Riera - @gastonriera
Link to only valuable pages
Remove hre鍖ang when you're uncertain
of the quality on other languages
15% size of index!
hre鍖ang are bidirectional,
remove them on every
language.
Remember
46. What are valuable pages?
Gast坦n Riera - @gastonriera
In short, pages we want Google to index.
47. Gast坦n Riera - @gastonriera
Link to only valuable pages
As per nofollow:
Nofollow on links to noindex pages
Filters and facets, all nofollow
The overall result? Google re-crawled more pages.
60%
48. So, why crawl capacity
management?
Gast坦n Riera - @gastonriera
49. Crawl budget stayed the
same.
Gast坦n Riera - @gastonriera
*On average, over the last 2yrs.