Back to Guides
    Technical SEO Crawling 10 min read

    XML Sitemap Best Practices: Help Google Index Your Content

    Target keyword: XML sitemap SEO · Tool: Site Audit (3 credits)
    Share

    An XML sitemap is your direct line to search-engine crawlers — a machine-readable list of the URLs you want discovered and indexed, with hints about when each last changed. It won't force Google to index a page or improve a weak page's rankings, but it dramatically speeds discovery on large sites, new sites, and sites with deep or poorly-linked content. Configured well, it gets your best content found fast; configured carelessly, it sends Google contradictory signals. This guide covers the best practices, validated with Vincony's Site Audit.

    Step 1: Generate Your Sitemap

    • Most CMS platforms generate sitemaps automatically. Ensure yours includes:
    • All indexable pages (not noindexed or redirected URLs)
    • Accurate lastmod dates (update when content actually changes)
    • Proper URL formatting (canonical versions only)
    • Sitemap index file if you have multiple sitemaps

    Step 2: Audit for Common Issues

    • Run Vincony's Site Audit to check for sitemap problems:
    • URLs returning 404 or redirect status codes
    • Noindexed pages listed in the sitemap (contradictory signals)
    • Missing important pages that should be included
    • Outdated lastmod dates that haven't been refreshed
    • Sitemaps exceeding 50,000 URL or 50MB limits

    Step 3: Segment by Content Type

    • For sites with diverse content, create separate sitemaps:
    • `/sitemap-pages.xml` — core site pages
    • `/sitemap-blog.xml` — blog posts and articles
    • `/sitemap-products.xml` — product pages
    • `/sitemap-images.xml` — image sitemap
    • `/sitemap-videos.xml` — video sitemap
    • A sitemap index file (`/sitemap.xml`) references all individual sitemaps.

    Step 4: Set Priority and Change Frequency

    • While Google says it largely ignores priority and changefreq values, setting them thoughtfully still helps communicate your content hierarchy:
    • Homepage and key landing pages: priority 1.0
    • Category pages: priority 0.8
    • Blog posts: priority 0.6
    • Archive/tag pages: priority 0.3

    Step 5: Submit and Monitor

    • Submit your sitemap through Google Search Console. Monitor the coverage report for indexing issues:
    • 'Submitted and indexed' — working correctly
    • 'Submitted but not indexed' — quality or technical issues
    • 'Crawled but not indexed' — Google found but chose not to index

    Key Takeaways

    • Only include indexable, canonical URLs — no 404s, redirects, or noindexed pages
    • Keep lastmod accurate — update it when content genuinely changes, not on every build
    • Segment by content type for large sites, referenced from a sitemap index
    • Submit via Search Console and watch the coverage report for 'crawled/discovered but not indexed'
    • Re-audit regularly — and remember a sitemap aids discovery, it doesn't guarantee indexing

    Frequently Asked Questions

    What is an XML sitemap?

    A machine-readable file listing the URLs you want search engines to discover and index, optionally with last-modified dates. It helps crawlers find your content efficiently, especially on large, new, or deeply structured sites.

    Does an XML sitemap improve rankings?

    Not directly. A sitemap aids discovery and indexing but doesn't boost a page's ranking. Its value is making sure your important content gets found and crawled quickly — a prerequisite for ranking, not a ranking factor itself.

    What should I include in my sitemap?

    Only indexable, canonical URLs that return 200 status. Exclude 404s, redirected URLs, noindexed pages, and non-canonical duplicates — including them sends contradictory signals and wastes crawl budget.

    How big can an XML sitemap be?

    A single sitemap can hold up to 50,000 URLs and 50MB uncompressed. Larger sites should split into multiple sitemaps (often segmented by content type) referenced from a sitemap index file.

    How do I submit my sitemap to Google?

    Submit it in Google Search Console under the Sitemaps report (e.g. yoursite.com/sitemap.xml) and reference it in robots.txt. Then monitor the coverage report for indexing issues like 'crawled but not indexed'.

    🛠️ Try it on Vincony

    Site Audit

    3 credits per use • Free credits on signup

    Ready to apply this guide?

    Sign up free at Vincony and get credits to start using Site Audit immediately.