Wikipedia:Bots/Requests for approval/Reports bot 2


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Symbol keep vote.svg Approved

Reports bot 2
Operator:

Time filed: 23:07, Wednesday, May 27, 2015 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): Python 3

Source code available:
 * Dependencies:
 * Wikipedia-facing scripts:
 * I invite pull requests to my scripts.

Function overview: Creates a fully automated WikiProject directory; generates lists of "related WikiProjects" based on set intersections; surfaces WikiProject-related activity metrics

Links to relevant discussions (where appropriate): Some preliminary discussion at Wikipedia talk:WikiProject Council

Edit period(s): Weekly

Estimated number of pages affected: There are over 2,600 WikiProjects and task forces. Each project would have a description page and a "related WikiProjects" page, plus there would be a full directory and sub-directory pages for each immediate subcategory of Category:WikiProjects by area. So it would affect thousands of pages, but as part of a contained set of pages; the bot would not edit any articles or even any WikiProjects.

Exclusion compliant (Yes/No): Not applicable; the bot only edits a specific set of pages it is responsible for maintaining

Already has a bot flag (Yes/No): Yes

Function details: This task involves two scripts responsible for making edits to Wikipedia: directory.py and related_projects.py.

The bot generates a list of WikiProjects based on the formal definition of WikiProjects which I have developed. A page is considered to be a WikiProject if it meets one or both criteria: (1) It is associated with a system of categories used for article evaluation, namely in the form of X articles by quality and its subcategories; (2) a page in the project namespace that is neither a redirect nor a subpage that (a) has a name beginning with "WikiProject " or (b) belongs to a category of the form X WikiProjects. The actual implementation of this definition is more complex than this, as there are a large number of exceptions, but this definition produces a very thorough list that includes WikiProjects and task forces with assessment systems, WikiProjects that never established an assessment system (or don't need one), and pages that are labeled as WikiProjects but have names other than "WikiProject Foobar".

For each WikiProject, metrics are calculated based on metadata stored on the database replicas on Tool Labs. These metrics are: (1) articles in project scope, for those that participate in assessment; (2) active WikiProject participants, defined as the number making 2+ edits in a rolling 90 day period; (3) active editors in the subject area, defined as the number making 5+ edits in an article or talk page in the project's scope in a rolling 30 day period (for those projects participating in assessment). Associated with the active WikiProject participant and active subject-area editor counts are lists of editors that meet that criteria, stored as a subpage of the directory. Each description page has buttons for opting out; users that opt out do not appear on any directory pages, nor do they count in editor counts. The opt-out page is currently located at User:Reports bot/Opt-out but I hope to eventually move it off-wiki. Each directory entry is posted to a list of all WikiProjects, plus lists for each category/subcategory the project belongs in (based on subcategories of Category:WikiProjects by area).

The directory script has been tested extensively in the bot's user space. To browse the test version, see User:Reports bot/Directory. Note that the directory in its test phase is limited to 150 WikiProjects, so many of the subdirectory pages are skeletal in nature. Upon approval I would like to move the directory to WikiProject Directory.

The lists of related WikiProjects will reside as subpages of Related WikiProjects; I haven't decided yet if that page will be an independent portal or a redirect to the directory. Each WikiProject and task force participating in article assessment is compared against the all the other ones, with the exception that projects are not compared to themselves and comparisons involving projects and task forces within those projects are not made. The ten projects that have the most articles in common with the project being compared are listed. The script has not been tested on-wiki but I have used the script to generate a 1.4 MB text file of what the output should be like.

Discussion
Please perform extend WikiProjects to 500. Then we can discuss moving to Wikipedia namespace. -- Magioladitis (talk) 23:31, 27 May 2015 (UTC)
 * Magioladitis: I extended the directory test to 500; you can see the updated directory here. For the related WikiProjects function, I ran it for all the WikiProjects I could do comparisons for; there was no real feasible way to reduce it to just 500 tests. All the entries are listed here, or you can see in-line links in the directory. Harej (talk) 01:58, 29 May 2015 (UTC)

Harej are the WikiProjects aware of this bot request? -- Magioladitis (talk) 11:57, 29 May 2015 (UTC)
 * I sent a message to Wikipedia talk:WikiProject Council and Wikipedia talk:WikiProject X, who are most likely to be interested in this bot request. Now that I can generate a list of every WikiProject, I could probably send a mass message to them, though I don't know that it would be worth it. Harej (talk) 14:29, 29 May 2015 (UTC)

Harej "a page in the project namespace that is neither a redirect nor a subpage that (a) has a name beginning with "WikiProject " or (b) belongs to a category of the form X WikiProjects." Are you aware of any project that does not meet (a)? -- Magioladitis (talk) 20:30, 29 May 2015 (UTC)
 * Yes, several. Basically anything at the beginning of the complete directory (anything that comes before "WikiProject" in the alphabet) plus Wikipedia for World Heritage and WikiTV which come later in the alphabet. Though not titled "WikiProject_" they are still sorted as WikiProjects. One of the values of the directory is surfacing WikiProjects you did not even know existed. Harej (talk) 23:01, 29 May 2015 (UTC)

Harej the output is very good. Let's wait 2-3 days so that we get feedback if anyone would like to. Ping me if I forget to revisit the page after 2 days. -- Magioladitis (talk) 23:18, 29 May 2015 (UTC)

Harej OK. Since it's been 24 hours and I expect no complains about this, I am approving it. -- Magioladitis (talk) 23:16, 30 May 2015 (UTC)

-- Magioladitis (talk) 23:17, 30 May 2015 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.