Wikipedia:Bots/Requests for approval/Chartbot 6


 * The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Symbol keep vote.svg Approved

Chartbot 6
Operator:

Time filed: 21:41, Monday April 1, 2013 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): PHP

Source code available:

Function overview: Correction and updating of obsolete Billboard URLs

Links to relevant discussions (where appropriate):

Edit period(s): one time run with clean-up runs

Estimated number of pages affected: 6500

Exclusion compliant (Yes/No): Yes

Already has a bot flag (Yes/No): Yes

Function details: This is the largest chunk of remaining obsolete Billboard URLs. They fall in three classes:

1) http://www.billboard.com/bbcom/retrieve_chart_history.do?model.vnuArtistId= &model.vnuAlbumId=


 * The artist_id presented is identical to the pid used by Chartbot 4. For these URLs, I can build http://www.billboard.com/artist/dummy/ . Billboard will redirect the URL and 404 the result, because "dummy" isn't a good artist. Note that the failing redirect is to http://www.billboard.com/artist/ /dummy, though. From that URL, I can extract , and build http://www.billboard.com/node/ . That page redirects to the current artist page which gives me enough information to build a URL to the chart listing. http://www.billboard.com/artist/dummy/147604 => http://www.billboard.com/artist/278867/dummy => http://www.billboard.com/node/278867 => http://www.billboard.com/artist/278867/shakira is an example of the progression.


 * The model.vnuAlbumId field has to be ignored. Billboard used to have a page per album that presented only the charting information for that particular album. That structure has been dropped, and the charting information is only filtered by performing artist at this time. In essence, all of these URLs have to get pointed one step up in the information hierarchy.

2) http://www.billboard.com/charts/
 * In these, the editor placed a link to the current edition of a chart: data that lasted precisely one week before being overwritten, generally several years ago. These will be processed using Chartbot 5's infobox logic. If there is at least one infobox of the types
 * person
 * musical artist
 * musician awards
 * album
 * single
 * song
 * artist discography
 * then Chartbot will validate that all the infoboxes in the article refer to the same artist. If so, it constructs a URL that points to this chart type entry for the artist.

3) http://www.billboard.com/song/ / / and http://www.billboard.com/album/ / /
 * These links face the same difficulty as link 1: even in the rare cases that I can map this to a recording page at all, the recording pages no longer carry any information: all they have is a discussion forum link. These links will get targeted at the chart history of the artist rather than the song.

4) http://www.billboard.com/artist/ /discography/songs/ and http://www.billboard.com/artist/ /discography/albums/
 * Similar issues as with 1 and 3 occur: Billboard used to split the chart data into an "albums" section and a "songs" section. It no longer provides this separation, and the data needs to repoint at the artist chart history.

&mdash;Kww(talk) 21:41, 1 April 2013 (UTC)

Discussion
 MBisanz  talk 01:45, 2 April 2013 (UTC)
 * Uneventful trial run from 2 apr 2013 05:24 to 2 apr 2013 06:02. No problems found. As expected, it sometimes provides a little broader URL than the context of the article would require, but it seems to be providing URLs that fill the need.&mdash;Kww(talk) 06:20, 2 April 2013 (UTC)
 *  MBisanz  talk 21:54, 2 April 2013 (UTC)
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.
 * The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.