Can I ignore the sitemap when crawling the website? What if my website has more pages than the sitemap includes?