User talk:JL-Bot/Archive 8

DOI search box
See. This could be templatified or hardcoded in the bot, up to you. &#32; Headbomb {t · c · p · b} 22:37, 9 August 2023 (UTC)
 * Done. I updated the bot to add it. -- JLaTondre (talk) 23:27, 10 August 2023 (UTC)
 * Not quite right though... &#32; Headbomb {t · c · p · b} 00:00, 11 August 2023 (UTC)
 * There is no difference in what is displayed? vs  -- JLaTondre (talk) 21:01, 11 August 2023 (UTC)
 * Well at the very least, two open inputbox tags is bad html/code. &#32; Headbomb {t · c · p · b} 21:05, 11 August 2023 (UTC)
 * I missed that. I focused on the extra line break that was added. I will fix. -- JLaTondre (talk) 22:14, 11 August 2023 (UTC)
 * Fixed. -- JLaTondre (talk) 13:08, 12 August 2023 (UTC)

WP:JCW/BADDOI
Legit DOIs go up to the 60000s now. So the limit should likely be bumped to 70000. &#32; Headbomb {t · c · p · b} 22:59, 9 August 2023 (UTC)
 * The maximum DOI is automatically calculated from the last Crossref pull. Currently, the Crossref pull is done after the dump file is processed. Unfortunately, that means if a new reference in the latest dump includes a DOI above the last limit, it will get flagged as bad. Instead of basing the Crossref pull on the existance of a new dump, I probably should just have it always execute on the 1st and 20th so it will be done before the dump is available. -- JLaTondre (talk) 23:40, 10 August 2023 (UTC)
 * Yeah probably. Could update all the JL-Bot/DOI stuff on those dates too, which would let me create the newest DOI redirects ahead of the full dump processing. &#32; Headbomb {t · c · p · b} 00:02, 11 August 2023 (UTC)
 * The DOI registrant pull will now run on the 1st and 20th of the month. -- JLaTondre (talk) 13:09, 12 August 2023 (UTC)

Doi prefix
The bot didn't do it's DOI prefix run on the 1st btw. &#32; Headbomb {t · c · p · b} 10:37, 4 September 2023 (UTC)
 * crossref.org errored out. I re-ran the job & crossref.org is responding this time. Results should be up in awhile. -- JLaTondre (talk) 14:21, 4 September 2023 (UTC)

How to remove orphan tag
How to remove Orphan tag from Blouse (Short Film) ? Rajmama (talk) 13:12, 21 September 2023 (UTC)
 * Looks like you already figured it out. But, yes, you edit the article and remove the orphan template. -- JLaTondre (talk) 23:54, 21 September 2023 (UTC)

Dec 20 bot run?
I think the bot choked or something? Normally bot runs for WP:JCW have occurred by now... &#32; Headbomb {t · c · p · b} 12:39, 24 December 2023 (UTC)
 * Yes, there was a hiccup on Friday. I fixed it yesterday and it has been processing since. Results are loading now. -- JLaTondre (talk) 13:59, 24 December 2023 (UTC)
 * Awesome. Hope you had a great festivus! And other upcoming soltice-adjacent holidays. &#32; Headbomb {t · c · p · b} 14:33, 24 December 2023 (UTC)
 * You too. -- JLaTondre (talk) 17:24, 24 December 2023 (UTC)

Possible new task
Could this bot be used to remove old transclusions? I found some from 2022 and one from 2021. Schierbecker (talk) 21:17, 18 January 2024 (UTC)
 * Probably not. The under construction removal task is based on number of days alone. For removing ITN note, it should probably be based on whether there is an open nomination vs. a strict number of days. You would be better off asking at Bot requests. -- JLaTondre (talk) 22:18, 19 January 2024 (UTC)

JCW dump?
I notice the bot hasn't processed the dump yet? Normally it's done within the first 3-4 days of the month, but we're on day 6 now... 142.169.80.39 (talk) 17:53, 6 May 2024 (UTC)
 * There was an error while running. I have kicked off processing again, but it will take awhile to complete. -- JLaTondre (talk) 22:17, 6 May 2024 (UTC)

Featured sounds no longer active
Hi, I think that "featured sounds" went the way of "featured videos", and is no longer being tracked. See: Featured sounds and Category:Historical featured content. Maybe it should be removed from the code and the documentation, next time that you're updating it? Thanks in advance! --Funandtrvl (talk) 00:37, 30 April 2024 (UTC)
 * Okay, thanks for the notice. I will update it. -- JLaTondre (talk) 22:48, 2 May 2024 (UTC)
 * Type content-featured-sounds has been removed. -- JLaTondre (talk) 21:09, 18 May 2024 (UTC)

ITN
Shouldn’t JL-bot, when updating ITN, put the ITN icon next to the article? 48JCL ( talk  •  contribs ) 21:58, 20 May 2024 (UTC)
 * I have added that one. It can be seen at WikiProject Women in Red/Recognized content. It will show up on other relevant pages with the next run this weekend. -- JLaTondre (talk) 20:16, 22 May 2024 (UTC)

Citations May 20 Output
@Headbomb : The output for the May 20 dump is producing significant less citations than normal. For example, the A's end on page 100 this time when they typically go to 111. I am investigating to see what is going on. -- JLaTondre (talk) 19:52, 22 May 2024 (UTC)

It looks like the enwiki-20240520-pages-articles.xml.bz2 dump file is missing content. It is only 18G where the last one (20240501) was 20G. It usually increases each month so that is an unexpected (and pretty big) decrease. There are no processing errors and no changes in the expected citation templates. -- JLaTondre (talk) 20:13, 22 May 2024 (UTC)


 * Would 'enwiki-20240520-pages-meta-current.xml.bz2' in https://dumps.wikimedia.org/enwiki/20240520/ be of use? Or would it be similarly crippled? &#32; Headbomb {t · c · p · b} 20:18, 22 May 2024 (UTC)
 * It looks smaller than last month's so probably crippled too. &#32; Headbomb {t · c · p · b} 20:19, 22 May 2024 (UTC)


 * The 20240601 dump is still not complete. It is typically done by this time of the month so seems like there are issues. -- JLaTondre (talk) 14:11, 8 June 2024 (UTC)
 * Hmm, I did find this announcement. It doesn't explain why the dumps have not been completed, but sounds like there might be a format change which could impact parsing once it arrives. I use a library for the parsing so not sure if it will impacted or not. -- JLaTondre (talk) 14:16, 8 June 2024 (UTC)
 * Well at least it's in progress. I checked earlier this month around the 3rd and it hadn't started.
 * See also https://www.mediawiki.org/xml/export-0.10.xsd vs https://www.mediawiki.org/xml/export-0.11.xsd &#32; Headbomb {t · c · p · b} 18:48, 8 June 2024 (UTC)
 * The 20240601 dump is now available. The bot is processing it & we will see how it goes... -- JLaTondre (talk) 19:52, 9 June 2024 (UTC)
 * Processing done. Looks good so far. Let me know if you see anything odd. -- JLaTondre (talk) 20:45, 10 June 2024 (UTC)
 * So far so good. &#32; Headbomb {t · c · p · b} 21:05, 10 June 2024 (UTC)

Cite tech report gone from Statistics?
This seems weird. &#32; Headbomb {t · c · p · b} 23:15, 2 August 2023 (UTC)
 * Short answer: The template was remamed.
 * Long answer: Cite techreport was moved to Cite tech report back in June. Everything managed to still work okay until updated the template usage in the articles. The parsing is based on the "real" template name. It is smart enough to also look for redirects to the template name, but it expects the primary name to be a non-redirect.
 * For a short-term fix, I can update the template name being checked. For a longer-term fix, I can update it to check that a template has not been renamed before parsing. However, instead of a hard-coded list (the current ones can be found highlighted in yellow here), can it be based on catagories? Maybe use Category:Citation Style 1 templates, Category:Citation Style 2 templates, and Category:Citation Style Vancouver templates? -- JLaTondre (talk) 00:25, 6 August 2023 (UTC)
 * A dynamic list could work. In Category:Citation Style 1 templates, there's a sandbox and Template:Cs1 function which aren't really templates. Maybe a membership in that category + name starts with Template:Cite_...? Same for Category:Citation Style Vancouver templates. CS2 is just Citation.
 * &#32; Headbomb {t · c · p · b} 01:08, 6 August 2023 (UTC)
 * For Bluebook style, there would be Bluebook journal, Bluebook website, and Cite court. reporter in Cite court is equivalent to journal in Bluebook journal. Bluebook website is likely useless since it doesn't seem to support journal. &#32; Headbomb {t · c · p · b} 01:12, 6 August 2023 (UTC)