Forum Activity for @soaringeagle

soaringeagle
@soaringeagle
01/22/17 10:11:59PM
3,304 posts

exclude photos from askimet spam detect


Suggestions

yea 1 of my 1st posts when joining jr was a module that did that..i tried to learn module development for that but was eynd my abilities
soaringeagle
@soaringeagle
01/22/17 10:07:55PM
3,304 posts

weird sitemap crawl results after urlscanner update


Installation and Configuration

got ya well the 1st 1 i found was after the human readable part
i think
it was after a username so might have been on her profile itself but thought it was a blog post
no actually i remember it was in an activity feed or timeline
i did contact inspyder but might take a couple days to get a reply
then they will likely need to run a crawl and see the results i see
it did seem to start though with 1 of the recent jr updates so you could probly try the trial version on a smaller site and see
soaringeagle
@soaringeagle
01/22/17 06:05:05PM
3,304 posts

weird sitemap crawl results after urlscanner update


Installation and Configuration

yea its odd behavior and just started with 1 of the recent updates
it seems like it just adds any external links to the end of the internal link crawls it it comes up as the same page itself and it adds the external links on again, thereby getting stuck in a loop crawling the same page over and over just adding to the url (if you click that link even though it has many urls added on it still gies to the original page)
soaringeagle
@soaringeagle
01/22/17 04:52:36PM
3,304 posts

weird sitemap crawl results after urlscanner update


Installation and Configuration

ugh just found this too
1/20/2017 8:06:12 PM - Warning: Request Timeout (try reducing "Number of Crawlers" in Advanced Settings): http://www.freedomswings.org/soaringeagle/youtube/83/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.youtube.com/!http:/www.facebook.com/!http:/www.facebook.com/!http:/www.facebook.com/!http:/www.facebook.com/!http:/www.facebook.com/!http:/www.facebook.com/!http:/www.facebook.com/Majsternmajster

now last week freedomswings had 23,000 urls in the sitemap
now its at 805,000 and still crawling
it gets stuck in these loops i think adding more and more urls to the intended url

verified
i checked there was an issue in the yt description that included ! before http
so i excluded /http already now had to exclude /!http too
but it definately loops adding the url on over and over and over
i will file a report with inspider as well
updated by @soaringeagle: 01/22/17 04:59:45PM
soaringeagle
@soaringeagle
01/22/17 04:45:54PM
3,304 posts

weird sitemap crawl results after urlscanner update


Installation and Configuration

yes cause jr sitemap creator i don't believe will list all pages correctly
plus doesnt alow you to set priorities change frequencies or custom exclusions
i use inspyder sitemap creator
but there was no inspyder update to explain the change in behavior only a jr update
inspyders been amazing (like you guys) at fixing bugs and adding custom features i sually get a beta version within hours when i make a suggestion or report a bug

theres been no upgrades in a fairly long time so the jr update is certainly the cause
the latest url scaner id say something in the change log about urls being handled oddly in some cases..so maybe its fixed
soaringeagle
@soaringeagle
01/22/17 04:40:23PM
3,304 posts

exclude photos from askimet spam detect


Suggestions

i originally had hoped to find a way that they would join into a new members quota and after x number of days be moved to full members
but never figured out how to do that
soaringeagle
@soaringeagle
01/22/17 03:58:31PM
3,304 posts

exclude photos from askimet spam detect


Suggestions

i am constantly dealling with legit photo galleries marked as spam and needing approval
i have never once ha any issues with image spam, and am active enough on my site to spot it and delete it if there ever was any
can you add the ability to exclude modules from the askimet scans so hundreds of photos dont get flagged and need to be approved all the time (without ever once catching legit spam)
updated by @soaringeagle: 04/24/17 04:31:29AM
soaringeagle
@soaringeagle
01/22/17 03:54:53PM
3,304 posts

weird sitemap crawl results after urlscanner update


Installation and Configuration

for the last week (may have been fixed in latest update sitemap crawl takes 4-5 days before i'll know) i was geting alot orunaway scans and weird urls like www.mysit.com/someforumpost> http://www.somesitelinkedinpost.com> http://www.anotherlink.com> http://www.3rdlink.com/ etc

i had to add *> http* to my exclusion list but am afraid now it will also block the post these scanned urls appear on
was this fixed? (i will not be able to tellnow that i excluded them)
however on both sites i had seen visitors on these weird urls and bots crawling them

updated by @soaringeagle: 04/28/17 10:39:08AM
soaringeagle
@soaringeagle
01/21/17 10:00:28AM
3,304 posts

queue depth just keeps increasing


Installation and Configuration

even when i upload it?
i am getting real poor db performance in performance test (60.something)
so maybe that's why
i am getting a free superpowered server in a few days (16 cores 48-64 gigs 15 tb raided sas 15000 rpm drives (he says its twice as fast as ssd)
and setting up a db server separately (i think on a vm)
hope that fixes it
soaringeagle
@soaringeagle
01/20/17 06:07:08PM
3,304 posts

queue depth just keeps increasing


Installation and Configuration

just when i marked it solved, the upload and integrity check worked fine on dreadlockssite but not freedomswings
cannot figure out why
only thing i can guess at is cause fwi is originally on fwi.dreadlockssite.com then i just pointed the vhosts to the subdomain folder within the dreadlockssite account space
i should be getting a free server upgrade soon (huge upgrade)
so if nothing else hopefully that will fix it
  74