Technical SEO Audit Checklist for E-Commerce Sites: 47 Points That Actually Matter | AuditMySite

· 5 min read

Why E-Commerce Sites Need a Different SEO Audit Approach

E-commerce sites aren't blogs with a shopping cart. They have unique technical challenges — faceted navigation generating millions of URL combinations, product pages that appear and disappear with inventory, cannibalization between category and product pages, and crawl budgets stretched thin across thousands of near-identical URLs.

A generic SEO audit misses the issues that actually cost e-commerce sites traffic and revenue. This checklist doesn't. We built it from auditing 400+ e-commerce sites ranging from 500 to 5 million pages. Every point is here because it moved the needle.

Section 1: Crawlability and Indexing (15 Points)

Robots.txt and Crawl Directives

  1. Robots.txt exists and is accessible. Verify at yourdomain.com/robots.txt. A missing or 500-error robots.txt causes Google to temporarily stop crawling.
  2. Faceted navigation URLs are blocked. Color, size, price range, and sort order filters create combinatorial explosion. Block these with robots.txt or use canonical tags. A site with 5,000 products and 10 filter dimensions can generate 50 billion URL combinations.
  3. Internal search result pages are noindexed. /search?q=blue+shoes creates infinite thin content pages. Add <meta name="robots" content="noindex">.
  4. Paginated pages use proper handling. Google no longer supports rel="prev/next" as an indexing signal. Instead, ensure paginated pages have unique, descriptive content and self-referencing canonicals. Consider infinite scroll with proper URL updates.
  5. Staging/dev sites are blocked. Check that staging.yourdomain.com has either robots.txt blocking all crawlers or password protection.

XML Sitemaps

  1. Sitemap exists and is submitted to Google Search Console. Check Coverage → Sitemaps for status.
  2. Sitemap only includes indexable, canonical URLs. Every URL in your sitemap should return 200, have a self-referencing canonical, and not be noindexed. Sitemaps with > 10% non-indexable URLs reduce Google's trust in the sitemap.
  3. Sitemap is segmented. Use separate sitemaps for products, categories, blog posts, and static pages. This helps you identify which section has indexing issues in GSC.
  4. Out-of-stock products are handled correctly. Don't 404 them if they'll return. Use availability: OutOfStock in schema and keep the page live. If permanently discontinued, 301 to the parent category or closest alternative.
  5. Sitemap dates are accurate. Google uses lastmod as a signal for recrawl priority. If every page shows today's date, Google ignores the signal entirely.

Crawl Budget Optimization

  1. Crawl stats in GSC show healthy patterns. Check Settings → Crawl Stats. Look for crawl rate drops, response time spikes, or high percentages of non-200 responses.
  2. Redirect chains are eliminated. No URL should redirect more than once. Chains of 3+ redirects waste crawl budget and leak PageRank.
  3. Orphan pages are identified and linked. Product pages with no internal links won't be crawled or ranked. Cross-reference your sitemap with a crawl (Screaming Frog, Sitebulb) to find pages that exist in the sitemap but have zero internal links.
  4. Parameter handling is configured in GSC. Tell Google which URL parameters change content (category, page) vs. which don't (session ID, tracking codes).
  5. Log file analysis shows efficient crawling. Parse your server logs to see what Googlebot actually crawls. If it's spending 60% of its time on filtered URLs, you have a problem.

Section 2: Site Architecture (10 Points)

  1. Click depth from homepage to any product is ≤ 4. Homepage → Category → Subcategory → Product. Any deeper and crawl priority drops dramatically.
  2. Breadcrumb navigation is implemented with schema. BreadcrumbList schema helps Google understand your hierarchy and displays as rich results in SERPs.
  3. Category pages have unique, substantial content. Not just a grid of products. Include 200-500 words of relevant copy, buying guides, or comparison content. Category pages should be your primary organic landing pages.
  4. Internal linking follows a hub-and-spoke model. Category pages link to all child products. Products link back to their category and to related products. Blog posts link to relevant categories and products.
  5. URL structure is logical and flat. Good: /category/product-name. Bad: /shop/all-products/category/subcategory/product/variant/color. Keep URLs under 100 characters.
  6. Canonical tags are correct on every page. Self-referencing canonicals on unique pages. Cross-domain canonicals if content is syndicated. Variant pages (color/size) canonical to the main product.
  7. Hreflang is implemented for international stores. If you sell in multiple countries/languages, hreflang tags prevent duplicate content issues and ensure the right version appears in each market.
  8. HTTPS is enforced everywhere. No mixed content warnings. All HTTP URLs 301 to HTTPS. HSTS header is set with at least a 6-month max-age.
  9. Mobile and desktop serve identical content. With mobile-first indexing, any content hidden on mobile is effectively hidden from Google. Check for desktop-only tabs, accordions, or sections.
  10. 404 pages return proper status codes. Soft 404s (pages that show "not found" content but return a 200 status) waste crawl budget. Verify with curl -I or Screaming Frog.

Section 3: On-Page SEO for Product Pages (12 Points)

  1. Title tags are unique per product with format: Product Name - Category | Brand. Under 60 characters.
  2. Meta descriptions are unique and include a CTA. "Shop [Product] at [Brand]. Free shipping on orders over $50." Under 155 characters.
  3. H1 tags match the product name and are unique per page.
  4. Product descriptions are unique. Don't use manufacturer descriptions — hundreds of other retailers have the same text. Write unique descriptions or at minimum, add unique content (reviews, comparison, use cases) around the manufacturer text.
  5. Image alt text is descriptive. "Blue Nike Air Max 270 running shoe side view" > "product-img-001.jpg"
  6. Image files use next-gen formats. AVIF or WebP with JPEG fallback. Properly sized (don't serve 2000px images in 400px containers). Use srcset for responsive images.
  7. Product reviews are on-page (not loaded via iframe or third-party widget that Google can't see).
  8. Related products section exists with proper internal links. This isn't just UX — it's internal link equity distribution.
  9. Out-of-stock products suggest alternatives. Don't dead-end the user or Google. Show "Customers also viewed" or "Similar products in stock."
  10. Price and availability are visible without JavaScript. Server-render critical product information so Google can index it without executing JS.
  11. Product variants don't create duplicate pages. If the blue and red versions of a shoe have separate URLs, they need distinct content or canonical tags pointing to the main product.
  12. User-generated content (reviews, Q&A) is indexable. This is free, unique content that targets long-tail queries. Make sure it's in the HTML, not loaded via API after page render.

Section 4: Structured Data (5 Points)

  1. Product schema on every product page — name, description, image, price, availability, brand, SKU, review/rating aggregate. Validate with Google's Rich Results Test.
  2. BreadcrumbList schema on all pages.
  3. Organization schema on the homepage — name, logo, social profiles, contact info.
  4. FAQ schema on category and guide pages where applicable. FAQ rich results drive significant CTR improvements.
  5. LocalBusiness schema if you have physical locations. Include address, hours, phone, and geo coordinates. This is especially important for businesses that combine online and physical presence. Local businesses in competitive markets see measurable ranking improvements from proper local schema implementation.

Section 5: Performance and Infrastructure (5 Points)

  1. Core Web Vitals pass on mobile for at least 75% of page loads (the "good" threshold in CrUX).
  2. CDN is configured correctly. Static assets (images, CSS, JS) should be served from edge locations. Check cache hit ratios — anything below 90% means your CDN isn't configured optimally.
  3. Server response time (TTFB) is under 500ms. For dynamic e-commerce pages, this often requires database query optimization, page caching, or edge computing.
  4. JavaScript rendering doesn't block critical content. Use server-side rendering (SSR) or static generation for product and category pages. Client-side rendered e-commerce sites consistently underperform in organic search.
  5. Monitoring and alerting is in place. Set up alerts for 5xx error spikes, crawl rate drops, and Core Web Vitals regressions. Tools like Sentry, Datadog, or even free UptimeRobot catch issues before they impact rankings.

Running Your Audit: Tools and Process

For a comprehensive e-commerce audit, you'll need:

  • Crawling: Screaming Frog ($259/year) or Sitebulb ($144/year) — crawl your entire site to find technical issues at scale
  • Search Console: Free. Your primary source of truth for how Google sees your site.
  • PageSpeed Insights / Lighthouse: Free. For Core Web Vitals assessment.
  • Ahrefs or Semrush: ($99-$449/month) For backlink analysis, keyword tracking, and competitive intelligence.
  • Log file analyzer: Screaming Frog Log Analyzer or custom ELK stack for understanding Googlebot behavior.

Proper brand positioning combined with technical SEO excellence creates a compounding advantage that competitors can't easily replicate.

Prioritization: What to Fix First

Not all issues are equal. Prioritize based on:

  1. Indexing blockers — anything preventing pages from being indexed (noindex errors, canonical issues, robots.txt blocks)
  2. Crawl budget waste — redirect chains, parameter URLs, duplicate content
  3. Performance issues — Core Web Vitals failures, especially on top-traffic pages
  4. Schema implementation — structured data gaps that cost you rich results
  5. Content optimization — thin product descriptions, missing meta data

An audit is only as good as the action plan it produces. Prioritize ruthlessly, fix the highest-impact issues first, and re-audit quarterly.

Ready to audit your site?

Run a free SEO scan and get actionable recommendations in seconds.

Start Free Scan →