weird sitemap crawl results after urlscanner update
Installation and Configuration
ok after loking at the generated jr sitemap i see a few issues (i believe we discussed some things while developing the module)
during development we discussed it adding pages from db urls from each module and adding new pages "on the fly' using the queue whenever pages were added too fast to write to the xml
i see 2 files 1 contains profiles 1 contains modules
1 module listed in sitemal is seamless with this url
http://www.freedomswings.org/seamless
and page content that obviosly dont need to be crawled
ihnstead the sitemap module should harvest urls that use the seamless lists like
http://www.freedomswings.org/soaring-videos-by-category
additionally every single page has priority 1
now does the sm module know that if you get over 50,000 profiles it needs to start a new file? sitemaps have a size and url mlimit 50k urls but sites with long urls might need to linmit it t 4ok so the long urls dont push it past size limits
an updated on the fly sitemap completely server side is a dream come true especially if it alows management of priorities and change fregeuencies at least on a module evel and independently on sb created pages
this is a good effort but i could suggest some grwat improvements that would make it worth paying for