Wikipedia:Bots/Requests for approval/CopyPatrolBot


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard. The result of the discussion was

CopyPatrolBot
Operator:

Time filed: 02:39, Monday, February 19, 2024 (UTC)

Function overview: Report potential copyright issues from CopyPatrol to PageTriage

Automatic, Supervised, or Manual: Automatic

Programming language(s): Python

Source code available:

Links to relevant discussions (where appropriate): Feedback about changes to CopyPatrol was solicited at m:Special:PermaLink/25725587.

Active period(s): Continuous

Estimated number of pages affected: n/a

Namespace(s): Main/Article, Draft

Exclusion compliant: n/a

Copyright violation bot: Yes (required for trial)

Function details: This is a replacement for EranBot (BRFA 3) to report potential copyright issues found in articles and drafts from CopyPatrol to PageTriage (shown in Special:NewPagesFeed). When a report is added to CopyPatrol, check if the page is in PageTriage. If so, report the revision. The bot does not edit.

Notes:
 * The code has been completely rewritten, migrating from Python 2 to Python 3.11 and from the XML-RPC-based API that EranBot uses to the new Turnitin Core API (TCA). CopyPatrol is also being migrated from Toolforge tools (EranBot is running on the deprecated Grid Engine.) and database to WMCS Cloud VPS instances.
 * Once this bot starts running, even for a trial, EranBot will be disabled.
 * We're currently waiting for the WMF to sign an agreement with Turnitin before we receive production TCA access. (required before trial)

Discussion

 * pending the TCA access. Otherwise looks fine to me, replacing functionality of existing bot. Can you drop a note on WT:NPP/R and WT:AFC please, alerting them of this BRFA. Also, is this handover being coordinated with the operator of EranBot? ProcrastinatingReader (talk) 11:30, 20 February 2024 (UTC)
 * The handover was authorized by User:ערן at T334265. JJMC89 and myself already have access to the  account on Toolforge and will be able to stop/restart it as needed. &mdash;  MusikAnimal  talk  15:59, 20 February 2024 (UTC)
 * Yes the handover is co-ordindated. Note that EranBot bot is used for multiple tasks beside copyright violations (outside enwiki), and the handover is specifically for copyvio aspect. Having a dedicated user for copyright violation bot seems to be more correct than a general bot, and thanks for JJMC89 and MusikAnimal drive the migration and the rewrite effort. Eran (talk) 16:31, 20 February 2024 (UTC)
 * I dropped pointers to this BRFA on those two pages. It would be helpful to have trial approval and  in advance of getting TCA access to cut down on any wait or downtime. (We're coming up on the Grid Engine shutdown soon.) —&thinsp;JJMC89&thinsp; (T·C) 23:33, 5 March 2024 (UTC)
 * , I think PR wants you to have access before granting a trial. Primefac (talk) 20:58, 10 March 2024 (UTC)
 * , please comment. —&thinsp;JJMC89&thinsp; (T·C) 21:13, 11 March 2024 (UTC)
 * Yeah, as Primefac said, I was waiting for this to be ready to begin before approving and requesting a crat give the bot the appropriate perms.
 * Not that it matters too much either way, but it seems like the delay time on TCA access is much greater than BAG delay time, and it's better practice not to assign userrights to bots-in-trial until they're ready to commence trial imminently. In this case, it seems like you're blocked on other things for an indeterminate period of time, before you're able to begin trial.
 * If you really want it sooner and Primefac is happy with that, then sure. Otherwise yeah, since there's no objections from users working in this area and there is the consent of the existing bot operator, I'm happy to tick this when you're ready to begin. ProcrastinatingReader (talk) 15:26, 12 March 2024 (UTC)
 * Also a poke to as some people involved in copyright issues, in case you have any comments, or know of anyone who might. ProcrastinatingReader (talk) 11:34, 20 February 2024 (UTC)
 * Please report back after this period on the bot's progress. Primefac (talk) 12:42, 13 March 2024 (UTC)
 * Primefac (talk) 12:00, 30 April 2024 (UTC)
 * Not the Bot Operator but this link is what I think you mean. Nobody  ( talk ) 12:43, 30 April 2024 (UTC)
 * The trial has been running for three weeks so far. I haven't encountered any issues. Feel free to review Special:Log/CopyPatrolBot at your convenience. —&thinsp;JJMC89&thinsp; (T·C) 00:48, 1 May 2024 (UTC)
 * [ logs] I haven't encountered any issues. —&thinsp;JJMC89&thinsp; (T·C) 00:11, 9 May 2024 (UTC)
 * I don't think that ?offset= link is working. Here's one without the offset: Log. – Novem Linguae (talk) 01:43, 9 May 2024 (UTC)
 * Fixed —&thinsp;JJMC89&thinsp; (T·C) 01:55, 9 May 2024 (UTC)
 * Primefac (talk) 12:58, 18 May 2024 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Bots/Noticeboard.