User talk:Butlerblog

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Commodore-64-Computer-FL

Category:Pages using infobox film with nonstandard dates[edit]

If your bot needs a new challenge, Category:Pages using infobox film with nonstandard dates. Gonnym (talk) 15:28, 25 April 2024 (UTC)[reply]

@Gonnym: Certainly! I think I can put something together that would take a bite out of that. ButlerBlog (talk) 16:17, 25 April 2024 (UTC)[reply]
It didn't take much to put together an initial bot for this. It can format single dates and should be able to clean up around half of the existing maintenance category. My expectation is that while running, I'll monitor for entries that the regex doesn't apply and see if we can address any additional entries.
ButlerBlog (talk) 17:31, 25 April 2024 (UTC)[reply]
Nice! I'll also keep watch and see if I find any strange edits. Gonnym (talk) 19:09, 25 April 2024 (UTC)[reply]
@Gonnym: While we're waiting on bot task approval, I'm wondering if you might refine the maintenance category a little bit. While I was doing a test run on the bot, I notice the category includes some of the following:
  • Empty |released=
  • |released= containing only a comment (i.e. <!-- {{Film date|Year|Month|Day|Location}} -->)
  • at least a couple entries had no |released= parameter included in the infobox
ButlerBlog (talk) 16:20, 26 April 2024 (UTC)[reply]
Yes, currently the category is also capturing instances where the date is also missing as that should be added. I might move that to a different category if you think that's better. Gonnym (talk) 20:01, 26 April 2024 (UTC)[reply]
It might be good to have those excluded just to have a more accurate number. But that's just a personal preference. It won't make a difference for the bot task either way. (In other words, only do it if you think there's a benefit and I'll go along with that.) ButlerBlog (talk) 20:23, 26 April 2024 (UTC)[reply]
Category split. Gonnym (talk) 18:04, 27 April 2024 (UTC)[reply]
@Gonnym: the bot task is approved. I'll start running it today. While I was waiting for the approval, I noticed the edit you corrected from the test run. It did have an exception for {{Infobox album}} but that same regex did not skip {{Infobox video game}}. I have since added an exception for that. In my testing, I did not notice any other issues, but if you notice anything wrong during the run, let me know as soon as you can - or trigger the shut off, and I'll address it right away. Thanks! ButlerBlog (talk) 12:35, 3 May 2024 (UTC)[reply]
addendum to the above - I'm pausing for now. I ran through around 700 articles on AWB manual just to be confident. Of those, I manually skipped 14 because it made a change (did not auto-skip as no change) and the change wasn't what it should be. I was able to refine my regexes to eliminate some of those situations, but not all of them. It's still at about a 1% error rate.
Errors that break things I believe I have eliminated - the regex has been refined to only explicitly hit {{Infobox film}} (avoiding any issues as noted previously). What's left are changes that do not completely change the contents of the parameter. In a few instances where there are 3 or more dates, the third and onward are not formatted. There are a couple of odd formatting that I have caught as well where the regex was not able to discern the type of format to apply. In these situations (which is all of what is left in the "errors" category), at least one date is formatted with {{Film date}}, but not remaining dates. My concern on those is that they would likely result in the article being cleared from the maintenance category, but still not fully formatted.
I am going to work with the 9 remaining from my test run log that were manually skipped and see if I can eliminate those from either being changed or to make sure they are fully changed. Most of them are fairly similar. Once I have that refined, I'll do another manual run for review and if the error rate is acceptable at that point, I'll run it completely. (So at least another update to come on this - thanks for your patience) ButlerBlog (talk) 15:17, 3 May 2024 (UTC)[reply]
Wow, good job on catching all those! I'll keep on eye on the changes and if I catch something I'll let you know. Gonnym (talk) 16:53, 3 May 2024 (UTC)[reply]
I've worked through those manual skips and I could solve all but 2. But that would be a 0.2% error rate,. I'd like it to be zero, but I also think the two that were not picked up are likely "one-offs" where there are not others like them. I'll do another test run manually and see if we're closer to zero and if so, I'll start it automatically. ButlerBlog (talk) 20:37, 3 May 2024 (UTC)[reply]
Another thing to watch for and maybe skip is pages with "franchise" or "film series" disambiguation. These are technically not using the correct infobox so I'm pretty sure that the data itself is also not in a style that can work with the film date template. Gonnym (talk) 05:27, 4 May 2024 (UTC)[reply]
Thanks! That may help weed out some of the questionable items. ButlerBlog (talk) 12:51, 4 May 2024 (UTC)[reply]
The fix at Asylum (2005 film) wasn't complete. I removed the second date but the festival should have been added to the template. Not a big issue though. Gonnym (talk) 14:03, 4 May 2024 (UTC)[reply]
Atlas Shrugged (film series) also didn't work. Gonnym (talk) 15:10, 4 May 2024 (UTC)[reply]
Thanks! I'll look those over before the next run. ButlerBlog (talk) 18:41, 4 May 2024 (UTC)[reply]
I'm still getting around 1% (or less) that match an existing regex to trigger a change, but are not formatted in the way the regex is expecting, so they get changed but not correctly (and because they match a regex, even imperfectly, they are not skipped). Each time, I add an adjustment to account for these, which of course adds to the level of complexity. So... in "going back to basics" what I'm going to do is take out very exact parts of the existing approved bot and run just for those to get all the "low hanging fruit" as it were. I'm going to start with the "year only" entries, then I'll add back the regex for single dates with no location or reference. That should clear a significant chunk. Then I'll return to the more complex regexes. ButlerBlog (talk) 16:27, 6 May 2024 (UTC)[reply]
Yeah, that sounds like a good way to handle this. Each time takes us one step closer. Gonnym (talk) 16:30, 6 May 2024 (UTC)[reply]
Just finishing up the simplified run through. It just did single dates where they were MDY, month year, or year only. That seems to have cleared almost half of the maintenance cat. Keeping with the concept of keeping it simplified, I'll add back one or two steps today and do another run through and see what that gets us. ButlerBlog (talk) 11:25, 7 May 2024 (UTC)[reply]
Another easy fix is converting start date like in Ballistic Kiss. And maybe also one date with country like in Ballet Mécanique. Gonnym (talk) 11:31, 7 May 2024 (UTC)[reply]
That was originally on my list. However... That's one that's a little tricky. There are some instances where there is another infobox that has |released= and depending on the formatting of the film date's released param, I've had instances where the first occurrence is skipped and it tries to change the second instance (which is the wrong infobox). But that will get worked back in at some point (when I can work out my regex to ignore the occurrences that I know it should be ignoring). ButlerBlog (talk) 12:22, 7 May 2024 (UTC)[reply]
Doh! I just discovered the "in template call" rule. That must be new (or "newer"). I know they've been working to update the program and I have installed a couple of updates the last few months after being behind for awhile, so I think that's a new option. Anyway, knowing that's there now is a major shift in what I can do (starting wtih this start date change). ButlerBlog (talk) 14:40, 7 May 2024 (UTC)[reply]

I'm doing another complete run-through now with some additional (from the original) regex patterns. Cleaning up {{start date}}, DMY, MDY, and year only. The previous run was looking for some DMY and MDY that were similar, but this picks up additional patterns. It also hits some year only entries that the previous run excluded (the ones that had HTML comments, which I'm actually removing because they are not necessarily consistent). ButlerBlog (talk) 15:14, 7 May 2024 (UTC)[reply]

So far, this is editing about half of the remaining articles. If that holds true through the remainder, we should get this down to less than 9k remaining. ButlerBlog (talk) 15:20, 7 May 2024 (UTC)[reply]
It ended up more like 2:1 edits:skips, so we got it down to around 6k to go. ButlerBlog (talk) 02:11, 8 May 2024 (UTC)[reply]
Wow, great job! I tried finding an easy pattern left in those 6k but so far didn't find any. Gonnym (talk) 07:15, 8 May 2024 (UTC)[reply]
Also, if you have a chance, can you run the bot on the TV category (notice that it's logic has expanded to the additional date parameters) so I can see what pages remain that need manual fixing? Gonnym (talk) 10:38, 8 May 2024 (UTC)[reply]
For sure! I've had it on hold for the time being because I need to make adjustments based on what's in there currently. ButlerBlog (talk) 19:14, 8 May 2024 (UTC)[reply]

Can I have your expertise?[edit]

Hi, could you tell me what caused you to revert these edits?
Someone asked a question about one edit at ANI (which is the wrong place to ask questions but, eh) and that edit was made by a similar IP who even edited one of the articles you reverted (the first one they edited): Special:Contributions/50.49.98.45.
There was also a previous IP that was blocked for disruptive editing (for 31 hours), who edited one of the other articles they edited, but I don't really understand, what is it that I should be watching out for in their edits? – 2804:F14:80C8:4701:3C07:6F36:D6F6:97FF (talk) 22:44, 28 April 2024 (UTC)[reply]

It is part of a range of IPs that has a pattern of problematic and disruptive editing. The issues for these IPs generally are excessive "see also" links (too much overlap or too broad), oversimplification of language, unnecessary links in article text, never uses edit summaries, and doesn't communicate via talk pages. Those things by themselves don't necessarily warrant broad-based reversion, and ordinarily, reverting those types of edits would involve some type of communication with the editor involved. However, in this particular case, the editor already knows this, doesn't communicate with other editors, and seems to just not care. ButlerBlog (talk) 11:43, 29 April 2024 (UTC)[reply]

The Rifleman two sources[edit]

I added two sources to replace the citation needed, I believe having both is sufficient to remove citation needed. Please check! :) Vajzë Blu (talk) 05:47, 6 May 2024 (UTC)[reply]

@Vajzë Blu: Actually, no, that's not sufficient. Before continuing to work on sourcing/citing, I would recommend a solid review of what makes a reliable (citable) source: WP:SOURCES. In looking at the list of qualifications there, ask yourself, "How does the source I am using qualify as a reliable source per Wikipedia's guidelines?"
You have to look at what your source is and where it comes from in order to answer those questions. What you've used in this instance is published via a "feed" from trend-chaser.com. That would not meet any of the criteria necessary to be considered a reliable source per our guidelines. It is a source, but is it a "reliable source"? ("No" is the correct answer.) If it's worth doing, it's worth doing right - right?
To support the content of the paragraph in question, I have found three sources that would qualify as sufficiently reliable sources. Take a look at them in light of what is required by WP:SOURCES and see if this starts to make more sense to you. If you truly can't see what was wrong with the citations you provided, please ask for more clarification and I will try to explain it better. ButlerBlog (talk) 13:50, 6 May 2024 (UTC)[reply]
Ok thank you! :) Vajzë Blu (talk) 04:11, 9 May 2024 (UTC)[reply]