User talk:Rlink2/Archive 7

Archiving sites that use Microsoft Sway
Just in case it piques your interest, I thought you might like to know about an awkward case.

I am interested in a project called East West Rail, a new railway line being built or rebuilt in England between Oxford and Cambridge. The consortium doing the work publishes a regular newsletter detailing the works, but they use Microsoft Sway. For example: Archive.org and Ghostarchive have a good attempt and get most of the report, archive.ph fails miserably. The struggle for A.O and Ga is with the sections where there is a more detailed section available for expansion on request. See for example the Ga capture at https://ghostarchive.org/archive/eSdzm at "Bicester & Launton".

I don't expect you to solve this of course, just interesting if it indicates a way for sites to obstruct the archivers. 𝕁𝕄𝔽 (talk) 17:25, 9 October 2022 (UTC)


 * @John Maynard Friedman
 * I will look into this in further detail, but can't you just archive those specific sections? Example:
 * https://ghostarchive.org/archive/61V97?kreymer=true
 * I'm sure exact images missing can be fixed with a simple feedback request. Rlink2 (talk) 18:02, 9 October 2022 (UTC)


 * Yes, I could, though I'm not clear how the result would be integrated? (A citation of the original sway "document" would be complete and valid, it would not seem possible to give an archive link for it that would not be incomplete without having to give the codicil too and I don't know if any citation template supports that.)
 * My point was more that it seems that their "link discovery" system has been foiled by Sway's technology. --𝕁𝕄𝔽 (talk) 18:51, 9 October 2022 (UTC)

Many of your changes today
Hi. I've noticed quite a few of the articles you've worked on today are showing up in Category:Pages using duplicate arguments in template calls. It's because there is a duplicate "url-status=live" in there. Cheers, Dawnseeker2000  23:22, 9 October 2022 (UTC)


 * Thank you for letting me know. I will fix things so it won't happen again. @Dawnseeker2000 Rlink2 (talk) 23:55, 9 October 2022 (UTC)

Archived Census 2000 URLs
Just FYI, the links aren't working. I suggest using a more mainstream archiving website like Wayback Machine or Archive.today, since GhostArchive seems to be buggy.  Sounder Bruce  23:25, 9 October 2022 (UTC)


 * @SounderBruce Example?
 * https://ghostarchive.org/archive/fHsv9 works for me.
 * Or it be that there is a problem with the original link. My 404 checker may not be functioning properly so if that is the case let me know so I can fix it.
 * Also note that archive.today will only archive the first page of a PDF. Rlink2 (talk) 00:00, 10 October 2022 (UTC)
 * This link was added to several Washington county articles and does not work.  Sounder Bruce  00:05, 10 October 2022 (UTC)
 * @SounderBruce And neither does the original link. Sometimes its harder to detect if its a soft 404.
 * I will try to update my thing, thanks for the heads up though Rlink2 (talk) 00:20, 10 October 2022 (UTC)
 * @SounderBruce
 * Also, my reason for using ghostarchive instead of archive.org, at least for this particular job, is that its way faster at handling new webpages (maybe because theres less people)?
 * When i need to go back to old webpages archived I usually always use the Wayback machine (in fact that is what should have happened here when the links were dead) but alas the 404 detector was not working so it didn't put it in. Rlink2 (talk) 12:32, 10 October 2022 (UTC)
 * We lost so many archive links never to be recovered when WebCite disappeared. The 1-man providers are prone to disappearance. They cost money to operate, generate little or no revenue, at the whim of a single benefactor. --  Green  C  13:33, 10 October 2022 (UTC)
 * @GreenC @SounderBruce Good point. I haven't been editing for a while so I kinda forgot about alot of things around here. Like I said before, I try to diversify my use of sites and will continue to do so.
 * The times when I use archive.today or ghostarchive it is because of:
 * paywall
 * Instagram/facebook
 * Youtube
 * which archive.org can not do.
 * I have always left the "general" archiving to archive.org/IABot, and use my AWB to get the sites that don't work with it.
 * Regarding this specific case, it happens to be that archiving PDFs is faster on ghostarchive.org than it is on archive.org. My run was using both websites, it just happenned to be there was a bug and it didn't work the way it should. Also note many of the PDFs are already archived on archive.org and put in the citation, so I am just doing the ones that haven't been done yet.
 * There are about 60,000 articles with ghostarchive on Wikipedia. Not all were from me (many people like ghostarchive on here), but even if you want to assume that is the case, that is only 20% percent of my total edits (again, assuming that I am responsible for 100% of the links, which I am not). That means at least 80% have nothing to do with archiving, or edits that were archiving with archive.org.
 * Maybe a wise thing is to always submit the URL to archive.org as well. It may be slower, but if Ghost goes down, we have a backup at archive.org for all of those URLs. I can try that. Rlink2 (talk) 14:34, 10 October 2022 (UTC)

Ghostarchive.org links are not working for certain domains.
Hey, your recent edit introduced archive links from Ghostarchive.org but links to citations from ISRO.gov.in, DOS.gov.in and www.mcgill.ca were not working hence I removed them and replaced them with Archive.org links wherever possible. Ohsin 09:23, 10 October 2022 (UTC)


 * @Ohsin thank you for the heads up. If you look at the links you'll see most of them are dead, which is the same problem the person above you also encountered. I am working on fixing this. Rlink2 (talk) 11:58, 10 October 2022 (UTC)

Moved page in /Music 2
I was unable to load /Music 2 in AWB to effect the page name change "Down Home (song)" &rarr; "Down Home (Alabama song)" when I was updating all of the What Links Here instances. VanIsaac, GHTVcont WpWS 23:59, 13 October 2022 (UTC)

RS challenge to archive.is,
You (and your page watchers) may wish to be aware of this question at Reliable sources/Noticeboard. 𝕁𝕄𝔽 (talk) 18:32, 18 October 2022 (UTC)