User talk:Citation bot/Archive 9

journal = www.foobar.com → journal = Www.foobar.com
The bot is not a human being, it cannot always deal with rubbish. Technically, DNS is not case sensitive, so it is not a bug. I guess we can add a check for "starts with www". AManWithNoPlan (talk) 14:08, 9 August 2018 (UTC)


 * Well, you can add an exception for www\.. Headbomb {t · c · p · b} 14:16, 9 August 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/524.  AManWithNoPlan (talk) 21:10, 9 August 2018 (UTC)

convert work to journal in cite journal, convert work to magazine in cite magazine
I think the first one is already done (work &rarr; journal) in, but something similar should also be done in cite magazine (work &rarr; magazine). Headbomb {t · c · p · b} 16:21, 12 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/550 AManWithNoPlan (talk) 01:27, 13 August 2018 (UTC)

fixed

Adding citeseerx url where citeseerx parameter exists
In edit one the bot added citeseerx. In edit two it added the url to the citeseerx paper. This seems like unnecissary, and it seems weird that a second run diectly after the previous run results in "more", and not everything is added during the first run.

https://en.wikipedia.org/w/index.php?title=Stephen_Hawking&diff=prev&oldid=854866425

--(t) Josve05a  (c) 09:44, 14 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/543 AManWithNoPlan (talk) 13:38, 14 August 2018 (UTC)


 * Still does it, and also adds handle urls when hdl is already set. Headbomb {t · c · p · b} 05:05, 15 August 2018 (UTC)
 * Yup. Until that pulls is pulled, it will still do that.  :-(  AManWithNoPlan (talk) 14:17, 15 August 2018 (UTC)
 * New pull to actually fix it. https://github.com/ms609/citation-bot/pull/572 AManWithNoPlan (talk) 03:37, 16 August 2018 (UTC)

fixed

GIGO? PMID=0
The database does return zero for the PMID. https://eutils.ncbi.nlm.nih.gov/entrez/eutils/esummary.fcgi?tool=DOIbot&email=martins@gmail.com&db=pmc&id=2676591 AManWithNoPlan (talk) 19:22, 14 August 2018 (UTC)
 * Pull submitted. https://github.com/ms609/citation-bot/pull/559  AManWithNoPlan (talk) 19:28, 14 August 2018 (UTC)
 * Which did not work. New one. https://github.com/ms609/citation-bot/pull/566 AManWithNoPlan (talk) 14:16, 15 August 2018 (UTC)

of --> Of
Possibly caused by the dot before 'Of'. It might be easier to add an exception for which I believe are the only ones affected by this. Headbomb {t · c · p · b} 05:13, 15 August 2018 (UTC)
 * Ann of Math
 * Ann of Math (2)
 * Ann of Sci
 * Ann. of Math.
 * Ann. of Math. (2)
 * Ann. of Sci.
 * Special case for "Ann. of " https://github.com/ms609/citation-bot/pull/565 AManWithNoPlan (talk) 14:30, 15 August 2018 (UTC)

More caps

 * För/för
 * Für/für [guessing this might be problematic too]

Headbomb {t · c · p · b} 06:06, 15 August 2018 (UTC)
 * Actually we already had the fur version. https://github.com/ms609/citation-bot/pull/565 AManWithNoPlan (talk) 14:30, 15 August 2018 (UTC)

fixed

Don't remove wikilinks from title, use title-link instead
Instead of removing wikilinks form title, the bot should add the link as title-link. https://en.wikipedia.org/w/index.php?title=Ignore_all_rules&diff=prev&oldid=855068220 was a bad edit. User:Josve05a 18:28, 15 August 2018 (UTC)
 * I have added some code that adds support for title-link should be deployed at some point. https://github.com/ms609/citation-bot/pull/569  AManWithNoPlan (talk) 22:31, 15 August 2018 (UTC)

fixed

Invalid ISBN
The bot added an invalid ISBN in [//en.wikipedia.org/w/index.php?title=Wikipedia_administrators&diff=prev&oldid=855141849 this edit]. 978160558012X is not valid. Might be GiGo. (t) Josve05a  (c) 06:42, 16 August 2018 (UTC)
 * GIGO it is. That’s the crossref ISBN.  https://api.crossref.org/v1/works/http://dx.doi.org/10.1145/1358628.1358871  Someone changed the ISBN 10 to a 13 and forgot that the check digit often changes.   It certainly is never X!   Since ISBN is a parity check and not an ECC type check we have no way of knowing what the error is.  Also a few books have been assigned invalid ISBN by publishers over the years.  That’s minor compared to the using the same ISBN for multiple books—which is one reason there is no cite ISBN.  AManWithNoPlan (talk) 13:41, 16 August 2018 (UTC)

notabug

Invalid DOI
The bot added  in [//en.wikipedia.org/w/index.php?title=Myocardial_infarction&diff=prev&oldid=855146551 this edit]. Seem wrong. Shouldn't we validate doi's before adding to avoid garbage? (t) Josve05a  (c) 07:41, 16 August 2018 (UTC)


 * Personally, I prefer the garbage addition in cases like this. It make humans look for the real, non-garbage doi. Headbomb {t · c · p · b} 12:11, 16 August 2018 (UTC)
 * I should note several things.  That is a standards compliant DOI-a suffix of zero length is allowed.  That is the DOI according to pubmed.  It is clearly rubbish though.  There has been a long discussion on this in the past and generally people seem to like dead DOIs since often you can google the string and find them.  Although perhaps an empty suffix is pretty useless. AManWithNoPlan (talk) 13:16, 16 August 2018 (UTC)
 * Once this is committed, the Bot will require that DOIs have at least one character in the suffix and one character in the prefix. https://github.com/ms609/citation-bot/pull/580 AManWithNoPlan (talk) 13:55, 16 August 2018 (UTC)

fixed

Bibcodes are not expanded
for those not wanting to look. AManWithNoPlan (talk) 00:13, 6 March 2018 (UTC)
 * My data handling improvements have already fixed this bug on GitHub development branch AManWithNoPlan (talk) 03:52, 6 March 2018 (UTC)
 * Is not filling in the bibcode when, for example, expanding from doi also fixed by this?
 * Tested with 2018/07/23 release, appears not fixed. Lithopsian (talk) 14:31, 24 July 2018 (UTC)
 * This is not fixed on Wikipedia servers, but it seems to work on the Travis servers and at home. Strange. AManWithNoPlan (talk) 02:39, 29 July 2018 (UTC)
 * Looks like the problem is that we've exhausted our 5000 queries for the day. I'll request a second key for testing, which may go some way to help. Martin  (Smith609 – Talk)  05:57, 30 July 2018 (UTC)
 * you can mention to the NASA folk that User:Bibcode Bot has been whitelisted for infinite (and I believe unthrottled) queries. Headbomb {t · c · p · b} 13:12, 30 July 2018 (UTC)


 * We've been granted extra requests, and I plan to refactor the code in the future so it makes better use of them. Marking as fixed. Martin  (Smith609 – Talk)  09:42, 3 August 2018 (UTC)
 * I just tried it on User:Headbomb/sandbox3... doesn't seem to work. Headbomb {t · c · p · b} 12:03, 3 August 2018 (UTC)
 * Does not work for me either. I wonder if he forgot key.  AManWithNoPlan (talk) 13:05, 3 August 2018 (UTC)

Expand citation: - Checking AdsAbs database x PHP_ADSABSAPIKEY environment variable not set. Cannot query AdsAbs. [..> yadsabs]: no record retrieved. [..> rossref] - Checking CrossRef database for doi. [..> indpmid] - Searching PubMed... nothing found. Possibly related, there seems to be something weird with that log beyond the key not being set. (stray y? Missing C?). Headbomb {t · c · p · b} 19:49, 14 August 2018 (UTC)
 * that’s debug information compressed to seven letters or less. AManWithNoPlan (talk) 02:32, 17 August 2018 (UTC)


 * It's working! Thank you very much to whoever made this happen! Headbomb {t · c · p · b} 14:40, 17 August 2018 (UTC)

It seems to be down right now
500 Internal Server Error AManWithNoPlan (talk) 04:43, 17 August 2018 (UTC)
 * fixed AManWithNoPlan (talk) 13:14, 17 August 2018 (UTC)

Whitelist all whitelisted parameters
See Module:Citation/CS1/Whitelist for a list. Can't we invoke this list somehow instead of a manually maintaining one on GitHub? (t) Josve05a  (c) 08:11, 27 July 2018 (UTC)


 * we are very aware of that page.  We could https a copy and then use http://php.net/manual/en/lua.call.php to run it or we could parse it with php if we are gutsy.   another idea is to have a process update the list automatically and make a git pull.  AManWithNoPlan (talk) 13:21, 27 July 2018 (UTC)
 * is it possible to directly run wiki lua files on the tool server without https getting them? AManWithNoPlan (talk) 13:25, 27 July 2018 (UTC)
 * Not sure if the question was adressed to me, because I have no idea. (t) Josve05a  (c) 06:26, 16 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/599 AManWithNoPlan (talk) 17:55, 19 August 2018 (UTC)
 * AManWithNoPlan (talk) 00:02, 21 August 2018 (UTC)

Bot does not always do what it says it is going to do
https://github.com/ms609/citation-bot/pull/589 AManWithNoPlan (talk) 15:41, 17 August 2018 (UTC)

Category API is crap
It always has been, the weight of tradition is against you. 😬😄😂😆😜😏 AManWithNoPlan (talk) 20:52, 17 August 2018 (UTC)
 * The problem is that the tool outputs text, not HTML. HTML ignores linefeeds.  It really is designed to be run on the command line.  AManWithNoPlan (talk) 20:55, 17 August 2018 (UTC)


 * It also seems to not edit the majority of articles it could edit. I'll investigate further, but it seems running the bot on individual articles in a category yield more edits. BTW, the new API for the single page run is beautiful. Headbomb {t · c · p · b} 20:56, 17 August 2018 (UTC)
 * The category code runs in FAST mode, that nice little check box that defaults to SLOW is not an option. AManWithNoPlan (talk) 21:06, 17 August 2018 (UTC)
 * That's certainly a bummer. Maybe it should be. Headbomb {t · c · p · b} 21:08, 17 August 2018 (UTC)
 * I had them change the "Expand citations" link off to the left size to that mode a while back. At least a "&slow=1" option should probably be added.  AManWithNoPlan (talk) 21:09, 17 August 2018 (UTC)
 * I shall take the fact that your complaint is "The formatting of the logfile offends my sensibilities" as a compliment to the current state of the bot. AManWithNoPlan (talk) 21:11, 17 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/596 AManWithNoPlan (talk) 22:06, 17 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/598 AManWithNoPlan (talk) 22:12, 17 August 2018 (UTC)


 * Turning on slow mode did more https://en.wikipedia.org/w/index.php?title=Draft%3ADr._Erick_Jones&type=revision&diff=855458969&oldid=855458884   AManWithNoPlan (talk) 13:03, 18 August 2018 (UTC)


 * What's the syntax? https://tools.wmflabs.org/citations/category.php?cat=GATEGORY&slow=1 ? Headbomb {t · c · p · b} 13:35, 18 August 2018 (UTC)
 * yes AManWithNoPlan (talk) 19:39, 18 August 2018 (UTC)
 * Just following this discussion, when I run it like this it only analyzes 1 page in the category. Retrying it makes it run on one more article etc etc. Redalert2fan (talk) 19:47, 18 August 2018 (UTC)
 * What full URL are you using? AManWithNoPlan (talk) 20:13, 18 August 2018 (UTC)
 * For example: https://tools.wmflabs.org/citations/category.php?cat=Electric_multiple_units_of_Japan&slow=1 Redalert2fan (talk) 20:15, 18 August 2018 (UTC)
 * interesting.  It only works on one page and stops.  AManWithNoPlan (talk) 21:33, 18 August 2018 (UTC)
 * The same happens without &slow=1, it seems categories are broken... Redalert2fan (talk) 08:53, 19 August 2018 (UTC)
 * Broken for a long time actually. AManWithNoPlan (talk) 18:08, 19 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/612 AManWithNoPlan (talk) 18:59, 19 August 2018 (UTC)

Multiple pages now fixed. AManWithNoPlan (talk) 02:04, 20 August 2018 (UTC)
 * I'm not sure that I ever envisaged this page being used by actual humans! Glad that it's coming in useful. As a treat, it is now in glorious technicolour (-: Martin  (Smith609 – Talk)  14:21, 21 August 2018 (UTC)

category output is ugly and generally useless

 * Category/Slow modes now fixed and working like a charm, but API is still outputting pretty unreadable crap. Headbomb {t · c · p · b} 15:57, 20 August 2018 (UTC)


 * AManWithNoPlan (talk) 00:07, 21 August 2018 (UTC)

Failed to capitalize when there's an & ?
This is possibly because there's nothing else but capitalization going on. Headbomb {t · c · p · b} 00:56, 18 August 2018 (UTC)
 * Are you sure it does not. It does for me. AManWithNoPlan (talk) 00:57, 18 August 2018 (UTC)

Do not add / remove class
https://github.com/ms609/citation-bot/pull/601 AManWithNoPlan (talk) 17:58, 19 August 2018 (UTC)
 * For User:Smith609 asking for consensus, see class= from non-cite arxiv templates|this thread. Headbomb {t · c · p · b} 00:20, 20 August 2018 (UTC)

Remove location in cite journal
https://github.com/ms609/citation-bot/pull/600 AManWithNoPlan (talk) 18:34, 18 August 2018 (UTC)

Remove stray commas at the end of title
https://github.com/ms609/citation-bot/pull/611 AManWithNoPlan (talk) 18:20, 19 August 2018 (UTC)

Capitalization is hard
should not be changed to  in journal as in https://en.wikipedia.org/w/index.php?title=Smilodon&diff=prev&oldid=855615980 (t)  Josve05a  (c) 16:26, 19 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/610 AManWithNoPlan (talk) 18:06, 19 August 2018 (UTC)

AManWithNoPlan (talk) 23:38, 20 August 2018 (UTC)

Use title-link for linked italicized titles as well
https://en.wikipedia.org/w/index.php?title=Tarrasch_Defense&diff=855512383&oldid=836004652 here the wikilink was just removed. (t) Josve05a  (c) 21:16, 18 August 2018 (UTC)
 * should remove italics too.  AManWithNoPlan (talk) 21:22, 18 August 2018 (UTC)
 * Italic markup should be removed from 'book' citations (work or aliases not set) but a cs1|2 template with a wikilinked title and without url is perfectly legitimate; there is an article Modern Chess Openings so:
 * is a correctly formed citation. This form is supported by the cs1|2 documentation at Template:Citation which read.
 * Do not misunderstand my point here as a contradiction of what I wrote elsewhere. When the whole title is wikilinked, that is acceptable.  But, when the title looks like this one from the other discussion, wikilinking is inappropriate:
 * A definitive abelisaurid theropod dinosaur from the early Late Cretaceous of Patagonia
 * —Trappist the monk (talk) 16:49, 19 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/604 AManWithNoPlan (talk) 18:03, 19 August 2018 (UTC)
 * A definitive abelisaurid theropod dinosaur from the early Late Cretaceous of Patagonia
 * —Trappist the monk (talk) 16:49, 19 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/604 AManWithNoPlan (talk) 18:03, 19 August 2018 (UTC)

Do not remove partial wikilinks in titles
In https://en.wikipedia.org/w/index.php?title=Donald_Trump&diff=prev&oldid=855512893 the bot changed |title=Trump Revealed: The Definitive Biography of the 45th President to |title=Trump Revealed: The Definitive Biography of the 45th President I don't belive this should be done. (t) Josve05a  (c) 21:22, 18 August 2018 (UTC)
 * These links are almost always wrong. They are things like title=Trump Revealed: The Definitive Biography of the 45th President .  AManWithNoPlan (talk) 18:01, 19 August 2018 (UTC)
 * Yes, but in cases of human error/garbage input, a bot should not be the arbitor of what is and what isn't correct. In this case it is correct, yet the bot will continue to bot war unless specifically told not to edit the page. (t) Josve05a  (c) 19:15, 19 August 2018 (UTC)


 * Or you can use Trump Revealed. Headbomb {t · c · p · b} 00:25, 20 August 2018 (UTC)
 * That would link the entire content of the title. In this case that is perhaps an acceptable solution, but not in all. (t) Josve05a  (c) 01:06, 20 August 2018 (UTC)
 * good idea. If more than one link remove.  Otherwise expand to whole text.  That will encourage people to fix bad ones AManWithNoPlan (talk) 02:06, 20 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/620 AManWithNoPlan (talk) 17:31, 20 August 2018 (UTC) AManWithNoPlan (talk) 23:38, 20 August 2018 (UTC)

Bot added invalid ISBN
❌ the ISBN in the metadata is 978160558012X sadly. https://api.crossref.org/v1/works/http://dx.doi.org/10.1145/1358628.1358871  AManWithNoPlan (talk) 19:19, 19 August 2018 (UTC)
 * Just because it is invalid does not mean that it is wrong, such as the evil which are tracked in https://en.wikipedia.org/wiki/Category:Pages_with_listed_invalid_ISBNs  AManWithNoPlan (talk) 20:13, 19 August 2018 (UTC)

Clean up PMC
https://github.com/ms609/citation-bot/pull/616 AManWithNoPlan (talk) 14:26, 20 August 2018 (UTC)
 * AManWithNoPlan (talk) 00:01, 21 August 2018 (UTC)

editorn-first
editor-lastn and editor-firstn is the recognized paramenter, while editorn-first (and last) is an alias - not the other way around. The bot should not change editorn-first to editor-firstn as in https://en.wikipedia.org/w/index.php?title=Rhagonycha_greeni&diff=prev&oldid=855515605 (t) Josve05a  (c) 21:49, 18 August 2018 (UTC)
 * There is no reason to change these parameters from one legitimate form to another legitimate for except to unify the form within the template. This applies to all multi word enumerated parameters: author-mask6, interviewer5-link, etc
 * —Trappist the monk (talk) 16:58, 19 August 2018 (UTC)


 * this is because the citation templates have sooo many parameter choices. This pull now will add all of them and will also add a check to make sure that we notice any new ones.  AManWithNoPlan (talk) 17:56, 19 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/599 AManWithNoPlan (talk) 18:07, 19 August 2018 (UTC)
 * AManWithNoPlan (talk) 00:02, 21 August 2018 (UTC)

More caps
https://github.com/ms609/citation-bot/pull/619/files AManWithNoPlan (talk) 16:26, 20 August 2018 (UTC)
 * AManWithNoPlan (talk) 23:39, 20 August 2018 (UTC)

ISBN numbers
Are referring to the ISBN 10 to 13 fixing? AManWithNoPlan (talk) 01:59, 20 August 2018 (UTC)
 * also white space should not be used in isbn, so we remove it AManWithNoPlan (talk) 19:03, 20 August 2018 (UTC)

Google Books in Cite Web are not expanded
Yup. Note that for google books URLs, cite web won't expand, but cite book will expand. Headbomb {t · c · p · b} 15:00, 20 August 2018 (UTC)
 * I absolutely hate having to manually change those to cite book..... Not sure why I did not do this sooner: https://github.com/ms609/citation-bot/pull/617 AManWithNoPlan (talk) 15:22, 20 August 2018 (UTC)

Bot adds doi url
https://github.com/ms609/citation-bot/pull/618 AManWithNoPlan (talk) 15:37, 20 August 2018 (UTC)
 * AManWithNoPlan (talk) 00:01, 21 August 2018 (UTC)

Translator link -> last##

 * The bot ran during a git pull and so the files were in an inconsistent state. AManWithNoPlan (talk) 15:16, 21 August 2018 (UTC)
 * Not sure how you figured that out, but glad to hear there's no underlying problem! (-: Martin  (Smith609 – Talk)  15:21, 21 August 2018 (UTC)

work parameter
Making edits such as https://en.wikipedia.org/w/index.php?title=Katarzyna_Bachleda-Curuś&diff=prev&oldid=855644962 may be a violation of COSMETICBOT. (t) Josve05a  (c) 20:49, 19 August 2018 (UTC)
 * I don't really understand why the bot would remove parameters like that there. Not only is it cosmetic, the edit doesn't make sense. --Izno (talk) 20:56, 19 August 2018 (UTC)
 * It is done to discourage the use of the generic and often misused work. In almost all cases, the journal, website, etc. are better choices.  Also, in this case publisher is already set to the wrong thing, it should use website instead, which is an alias for work.     AManWithNoPlan (talk) 21:17, 19 August 2018 (UTC)
 * I will work on code to do a better job. AManWithNoPlan (talk) 21:20, 19 August 2018 (UTC)
 * Here is the improvement. https://github.com/ms609/citation-bot/pull/614   If an alias of work is filled in, the the empty work will be removed to discourage future adding of it which would be an error.  If an alias of work is not set, then it will be changed to a template specific parameter if relevant: for example inf cite journal the empty work will be changed to an empty journal.  AManWithNoPlan (talk) 21:37, 19 August 2018 (UTC)
 * Just ensure it is compliant with COSMETICBOT or have gotten BAG approval to be making edits which will make no appearance difference. (t) Josve05a  (c) 21:42, 19 August 2018 (UTC)
 * WP:COSMETICBOT compliance is good for large-scale runs, but since the bot is user activated, it's not that big a deal if the bot does minor cleanup like that on select, user-requested pages. There's tons of cosmetic things (e.g. page→pages), which would in theory be nice to categorize as cosmetic, but this realistically would only be an issue if you run the bot on categories. So maybe in 'category mode', it should skip the cosmetic stuff. Seems a like a very high time cost for little payoff though at this time, but would matter if the bot started to edit on its own. Headbomb {t · c · p · b} 00:34, 20 August 2018 (UTC)
 * AManWithNoPlan (talk) 00:02, 21 August 2018 (UTC)

Put who activated the bot in the edit summary
I'm pretty sure the bot used to mention who activated it at some point. This will be particularly important when the Category api will be invoked. Headbomb {t · c · p · b} 12:49, 21 August 2018 (UTC)

Raw url refs
Not gonna happen. We work on templates only. That is basically a different bot. AManWithNoPlan (talk) 04:30, 20 August 2018 (UTC)


 * "Citation bot" has a scope of citations. There's no reason not to deal with raw urls. Headbomb {t · c · p · b} 04:32, 20 August 2018 (UTC)


 * we do nothing with references.  We only do templates. It would require a completely different code base. AManWithNoPlan (talk) 04:34, 20 August 2018 (UTC)


 * It's already approved for this. See Bots/Requests_for_approval/Citation_bot_8. Headbomb {t · c · p · b} 04:35, 20 August 2018 (UTC)


 * approved yes. Code no. AManWithNoPlan (talk) 04:38, 20 August 2018 (UTC)

The 'entirely new' codebase is a minute amount of pre-processing. ]*)>http(s)?:\/\/([^ \>]+)<\s*\/\s*ref> →  If the bot gets a hit with that, expand, if not, keep raw url. Headbomb {t · c · p · b} 04:41, 20 August 2018 (UTC)
 * Actually not that simple. Search needs to be non greedy.  Would need to have some type of bogus parameter set to either completely revert at end or just remove the specific flag.  Lots of testing, and that’s the pain. AManWithNoPlan (talk) 04:54, 20 August 2018 (UTC)


 * Point is, this wouldn't be an epic 3-months long development process. Yes some thought needs to be put into it, but the codebase for recognizing stuff from URLs is relatively mature. If that gets a hit, whatever you're going to have will beat the raw url. And it'll save doing before running the bot to get . Headbomb {t · c · p · b} 04:58, 20 August 2018 (UTC)

Laying in bed it came to me: We have do that since url might get upgraded but find no title, such as citeseerer or bad doi. AManWithNoPlan (talk) 12:59, 20 August 2018 (UTC)
 * 1) search for and change to cite web, but with extra CITATION_BOT parameter that is encrypted url
 * 2) do normal bot stuff
 * 3) when writing out look for CITATION_BOT flag and If no title is set then just decrypt url and echo that. If title is set remove special flag.

More information; AManWithNoPlan (talk) 13:10, 20 August 2018 (UTC)
 * 1) the backup url is encrypted to hide it from bot
 * 2) CITATION_BOT is a magic phrase so if we leave one in the Bot will notice
 * 3) we have unused code to call Citoid but they throttle us too much
 * 4) we need to reject any urls with spaces or set braces or square braces just to be safe
 * 5) we would want to try this on the dev bot first

Work in progress https://github.com/ms609/citation-bot/pull/621 AManWithNoPlan (talk) 20:28, 20 August 2018 (UTC)


 * Note,  works better. Headbomb {t · c · p · b} 20:41, 20 August 2018 (UTC)


 * Opinions on this https://github.com/ms609/citation-bot/pull/628  AManWithNoPlan (talk) 19:34, 21 August 2018 (UTC)
 * No opinion on the technical side of things, but that's pretty much what I suggested above, so I'm all for it. Headbomb {t · c · p · b} 19:35, 21 August 2018 (UTC)


 * Seems to be missing \[? and \]? around the URL to catch  Headbomb {t · c · p · b} 19:39, 21 August 2018 (UTC)
 * Character escaping seems off, my understand is that '\>' could be '>'. That or all the other '<' or '>' need to be escaped. Headbomb {t · c · p · b} 19:43, 21 August 2018 (UTC)
 * No longer escape things that do not need escapes in RegEx. Added support for [http.....] AManWithNoPlan (talk) 20:03, 21 August 2018 (UTC)
 * Very much looking forward to test this. I've got plenty of pages in the pipeline. Headbomb {t · c · p · b} 20:08, 21 August 2018 (UTC)

This works, so, sooooooooooo well it's insane. Whenever it fails, it's because there's an actually problem with the url/identifiers. This is amazing. Headbomb {t · c · p · b} 14:18, 22 August 2018 (UTC)

Deleted google books URL listed even though nothing is deleted
I've noticed this a few times now; there's an example visible at

https://tools.wmflabs.org/citations/doibot.php?page=Talk:Galileo%20Galilei

See that line - https://books.google.com/books ? We must be adding and removing this URL somewhere along the line, because we're not editing the citation...

Martin  (Smith609 – Talk)  14:44, 21 August 2018 (UTC)


 * Possibly related to User talk:Citation bot or User talk:Citation bot/Archive 29? Headbomb {t · c · p · b} 14:59, 21 August 2018 (UTC)
 * It is not doing anything other than echoing that url. Strange.  AManWithNoPlan (talk) 15:11, 21 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/627 AManWithNoPlan (talk) 15:50, 21 August 2018 (UTC)

New feature request, run from talk category
Using the bot on Category:Draft-Class Astronomy articles will, obviously, run the bot on that category.
 * https://tools.wmflabs.org/citations/category.php?cat=Draft-Class_Astronomy_articles

However, it would be very useful to have the bot run on the main pages associated with those talk pages in the category. I.e. take all pages in Category:Draft-Class Astronomy articles, convert to main pages, and run the bot on those. Perhaps via something like or
 * https://tools.wmflabs.org/citations/ talk category.php?cat=Draft-Class_Astronomy_articles
 * https://tools.wmflabs.org/citations/category.php? talk cat=Draft-Class_Astronomy_articles

Headbomb {t · c · p · b} 14:39, 17 August 2018 (UTC)


 * https://tools.wmflabs.org/citations/category.php?cat=Draft-Class_Astronomy_articles&talk=1 would be the way to go. The Bot would get a list of the category and then add "Talk:" to each page title (or change "User:" to "User_Talk:" in the case of user pages or just ignore user talk pages).  AManWithNoPlan (talk) 18:28, 19 August 2018 (UTC)

Martin  (Smith609 – Talk)  14:19, 21 August 2018 (UTC)
 * Possible solution [Suggestion 1]: modify function  in WikiFunctions.php so that it removes the namespace from all pages that it queues up to visit.  I can't think of many situations when one would want to run the bot on pages outside the main namespace, and it could be disruptive if someones deliberately included an incorrectly formatted citation in a discussion.
 * Alternative solution [Suggestion 2]: Add the main namespace equivalent of each Talk page to the array of pages to be visited.
 * I've gone ahead and implemented suggestion 1. Martin  (Smith609 – Talk)  14:54, 21 August 2018 (UTC)


 * Which is suggestion 1? Or more clearly, what's the syntax for this? Headbomb {t · c · p · b} 19:44, 21 August 2018 (UTC)
 * Whatever he did, he did not commit it to github AManWithNoPlan (talk) 19:47, 21 August 2018 (UTC)
 * https://tools.wmflabs.org/citations/category.php?cat=Draft-Class_Astronomy_articles now visits all mainspace articles whose talk pages are in Category:Draft-Class_Astronomy_articles. Martin  (Smith609 – Talk)  05:08, 22 August 2018 (UTC)


 * that's not very useful though (Mainspace articles associated with draft talk? What'd be the use of that?). What would be useful is if it visited the Foobar:<...> pages associated with Foobar talk:<...>. Headbomb {t · c · p · b} 11:17, 22 August 2018 (UTC)


 * Ah, I'd missed the introduction of the "Draft" namespace. I see what you are trying to accomplish now.  .  Martin  (Smith609 – Talk)  09:18, 24 August 2018 (UTC)
 * so now it works as {Suggestion 2}? Headbomb {t · c · p · b} 11:09, 24 August 2018 (UTC)
 * Yep, should do. Let me know how you get on. Martin  (Smith609 – Talk)  12:46, 24 August 2018 (UTC)
 * works flawlessly. Save for all the other bugs, which are now much more efficiently triggered! Headbomb {t · c · p · b} 16:32, 24 August 2018 (UTC)

Better pubmed url recognition
https://en.wikipedia.org/w/index.php?search=in-source%3A%22https%3A%2F%2Fwww.ncbi.nlm.nih.gov%2Fsites%2Fentrez%2F%22&title=Special:Search&profile=default&fulltext=1&searchToken=k3ig3e0tgj7hn3p0ig9gmvkk Does not look worth it to me AManWithNoPlan (talk) 21:52, 21 August 2018 (UTC)


 * Better search: but you might be right here. Headbomb {t · c · p · b} 22:50, 21 August 2018 (UTC)
 * That link gives no hits. AManWithNoPlan (talk) 23:46, 21 August 2018 (UTC)


 * Better . Headbomb {t · c · p · b} 00:04, 22 August 2018 (UTC)


 * most of those do not link to and article AManWithNoPlan (talk) 03:01, 22 August 2018 (UTC)
 * All of those do?  = . &uid= is the pmid, see . Headbomb {t · c · p · b} 04:17, 22 August 2018 (UTC)
 * interesting.  The links are dead, but the pmid is in the link AManWithNoPlan (talk) 04:34, 22 August 2018 (UTC)


 * Well they're not dead for me. Headbomb {t · c · p · b} 15:10, 22 August 2018 (UTC)
 * Dead on the mobile site. Interesting.  That’s a good reason to recognize and fix.  AManWithNoPlan (talk) 15:18, 22 August 2018 (UTC)
 * ❌ Fixed them all by hand since only a couple were in citation templates. AManWithNoPlan (talk) 22:14, 26 August 2018 (UTC)

Caps: der der
See. Headbomb {t · c · p · b} 21:55, 26 August 2018 (UTC)
 * back to back der seems to not work right. AManWithNoPlan (talk) 22:13, 26 August 2018 (UTC)


 * Then don't bother fixing. der der is obviously a typo. Headbomb {t · c · p · b} 22:16, 26 August 2018 (UTC)

Category edit summary should mention who activated the bot when possible
E.g.

This was activated via https:// tools.wmflabs.org/citations/doibot.php?edit=toolbar&slow=1&user=Headbomb&cat=Particle_physics%20stubs

And the edit summary should reflect this. Possibly even deny category runs without a  value specified. Headbomb {t · c · p · b} 01:22, 23 August 2018 (UTC)


 * Working for me:

https://tools.wmflabs.org/citations/category.php?cat=X1&user=Smith609 generated https://en.wikipedia.org/w/index.php?title=User%3ADOI+bot%2FZandbox&diff=prev&oldid=856337766


 * Category.php will now show a note where the username is invalid or not specified.
 * As you're making more use of it I'll add a user-friendly interface to doibot.html in the future, to save manually editing URLs... Martin  (Smith609 – Talk)  14:27, 24 August 2018 (UTC)


 * I'm triggering it via User:Headbomb/citations.js, but that would be a great features for the average Wikipedian. Might want to implement this in Citation expander too, but I'd rather wait till we have a "run on list" (would be very useful in the Book: namespace, e.g. Book:Canada) and throttling before making that request. Headbomb {t · c · p · b} 14:34, 24 August 2018 (UTC)
 * Now you can prompt the bot to edit multiple pages by separating their names with a pipe, e.g. "page1|page2". Martin  (Smith609 – Talk)  19:59, 24 August 2018 (UTC)


 * Interesting, although I don't know that it's very user-friendly to do that. Couldn't easily run it on say, Book:Canada or User:Headbomb/Sandbox4. And I'm not sure it could handle having 2000 articles shoved in as . Headbomb {t · c · p · b} 20:58, 24 August 2018 (UTC)

Caps: della
. Headbomb {t · c · p · b} 00:27, 23 August 2018 (UTC)
 * AManWithNoPlan (talk) 23:46, 23 August 2018 (UTC)

More caps: um, von
https://github.com/ms609/citation-bot/pull/638 AManWithNoPlan (talk) 23:45, 23 August 2018 (UTC) https://github.com/ms609/citation-bot/pull/650 AManWithNoPlan (talk) 15:05, 24 August 2018 (UTC)
 * I have a similar one, the Dutch "en", which means "and", should not be capitalised to "En" either, as done here at the bottom: FunkMonk (talk) 04:13, 24 August 2018 (UTC)

Capitalising every word in journal names
This is more a question than a bug report, is it intentional that all words that are normally not capitalised, such as "the, as, in, an, of" etc., in journal names are now capitalised? And now it seems another bot is changing some of this back? FunkMonk (talk) 16:14, 23 August 2018 (UTC)


 * Journals should use title case, so if something doesn't use title case, that would be an issue. Converting to title case is fine and encouraged. Note that there was a bug in Citation bot that capitalized journals by mistake for a little while, that's fixed now. Headbomb {t · c · p · b} 16:18, 23 August 2018 (UTC)
 * Ok, that was what I was referring to, seems I had not completed the sentence above. FunkMonk (talk) 16:47, 23 August 2018 (UTC)

❌. Fixed already AManWithNoPlan (talk) 23:07, 23 August 2018 (UTC)

doibot.php incorrectly thinks nothing changed and avoids null edit
To be clear, I tried running Citation bot on this today, and it failed to update the caps. So I did it via JCW-CleanerBot instead. It just so happened that Citation bot made the last edit before JCW-CleanerBot. Headbomb {t · c · p · b} 23:57, 23 August 2018 (UTC)
 * how are launching the bot? It worked for me both master and dev version  AManWithNoPlan (talk) 00:17, 24 August 2018 (UTC)


 * Via . Test page is setup at User:Headbomb/sandbox4. Headbomb {t · c · p · b} 00:31, 24 August 2018 (UTC)


 * doibot.php incorrectly does not notice some changes.  Because nothing was reported as changed the bot avoided doing a null edit AManWithNoPlan (talk) 01:47, 24 August 2018 (UTC)
 * so what's the most bleeding edge / most advanced code / unstable code available to mortals? Headbomb {t · c · p · b} 01:50, 24 August 2018 (UTC)


 * the gadget api where you are editing the file works right.  Also if there had been more to do on the page then it would have done the modifications-AManWithNoPlan (talk) 01:56, 24 August 2018 (UTC)


 * Another one. It's really annoying to have to do them by hand when the bot could be fixed them. Headbomb {t · c · p · b} 00:05, 25 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/666 The pull of the beast AManWithNoPlan (talk) 01:21, 25 August 2018 (UTC)
 * \m/(-&#95;-)\m/ Headbomb {t · c · p · b} 01:21, 25 August 2018 (UTC)

Erroneously upgrades cite arxiv to cite journal
bibcodes typically denote journals. The bot now (PR 647) won't assume that a bibcode denotes a journal, if the bibcode contains the string "arxiv". Are there any other cases that we should watch out for? Martin  (Smith609 – Talk)  12:00, 24 August 2018 (UTC)


 * There are. I'll get a list. Headbomb {t · c · p · b} 12:04, 24 August 2018 (UTC)

being years. That should cover it. You could simplify it to,   and   for those with sub-arxivs. Actually no, that might lead to some collisions. Headbomb {t · c · p · b} 12:09, 24 August 2018 (UTC)
 * 1) arXiv
 * 2) gr.qc
 * 3) hep.ex
 * 4) hep.lat
 * 5) hep.ph
 * 6) hep.th
 * 7) math.ph
 * 8) math
 * 9) nucl.ex
 * 10) nucl.th
 * 11) physics
 * Thanks for the list. These additions . Martin  (Smith609 – Talk)  18:00, 24 August 2018 (UTC)

Feature request: Throttle rate on categories to comply with BOTPOL
WP:BOTPOL suggests that rate for non-critical tasks. While I'm not a stickler for rules, the potential for damage is relatively high, especially in several WP:BEANS scenario (it just edited at a rate of 25 EPM on a recent run!). So implementing a per-activation throttle would be best. Headbomb {t · c · p · b} 12:34, 24 August 2018 (UTC)


 * I've coded a throttle, but not tested it; I'd be grateful if you could keep an eye out and see whether you notice throttling in action! Martin  (Smith609 – Talk)  17:44, 24 August 2018 (UTC)


 * seems to work. I unleashed it on Category: CS1 maint: PMC format which had 15 very easy edits to make, and it cleared it at 6 EPM. The category API doesn't update during the run, but you do get the results after the run. Not sure if the throttle is 'smart' (edits at 6/min) or 'dumb' (processes at 6/min), but it's working. Headbomb {t · c · p · b} 18:13, 24 August 2018 (UTC)
 * Great. The script now keeps track of when it last edited, and makes sure that this was at least 10 seconds ago, which probably makes it 'semi-smart' (as if it spent the first 50 seconds of a minute without making an edit, it could squeeze five into the last ten seconds!) Martin  (Smith609 – Talk)  06:38, 25 August 2018 (UTC)

More caps: al, delle
See. Headbomb {t · c · p · b} 10:10, 26 August 2018 (UTC)

Remove archive when removing url
https://github.com/ms609/citation-bot/pull/632 AManWithNoPlan (talk) 23:59, 21 August 2018 (UTC)

No more capitalization
should not be capitalized (at least not in ), see https://en.wikipedia.org/w/index.php?title=Leptoconops&diff=prev&oldid=855945017 (t)  Josve05a  (c) 21:49, 21 August 2018 (UTC)
 * The Tower of Babel strikes again. AManWithNoPlan (talk) 22:01, 21 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/631 AManWithNoPlan (talk) 22:02, 21 August 2018 (UTC)

Forget more things
Forget JSTOR (or JSTOR) and registration if the bot forgets https://www.jstor.org/stable/29780032 in favor of jstor; see https://en.wikipedia.org/w/index.php?title=Sir_John_Ellerman,_1st_Baronet&diff=prev&oldid=855994139 <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 05:58, 22 August 2018 (UTC)


 * In general access-date / archive-date / archive-url / dead-url / format / registration / subscription / url-access / via can be all be omitted if there are no url. format is tricky though, since it's abused for a lot of things that should be in type instead. Headbomb {t · c · p · b} 13:13, 22 August 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/637  AManWithNoPlan (talk) 23:38, 23 August 2018 (UTC)

access icon parameter left behind after url removed
From this:

the bot produced this:

—Trappist the monk (talk) 13:39, 22 August 2018 (UTC)


 * See also User talk:Citation bot. Headbomb {t · c · p · b} 14:00, 22 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/637 AManWithNoPlan (talk) 23:38, 23 August 2018 (UTC)

Strip dot at end of volume/issue/page/pages
Could be generalized to any punctuation. Headbomb {t · c · p · b} 01:58, 24 August 2018 (UTC)

Drop more parameters when dropping url
https://github.com/ms609/citation-bot/pull/637 AManWithNoPlan (talk) 23:37, 23 August 2018 (UTC)

More raw url fixings (JSTOR)
The bot should be able to detect that https://www.jstor.org/stable/pdf/10.1525/j.ctt1ffjn5h.6.pdf is the same as https://www.jstor.org/stable/10.1525/j.ctt1ffjn5h.6 and be able to expand that. It failed to do so here https://en.wikipedia.org/w/index.php?title=Mughal%E2%80%93Safavid_War_(1649%E2%80%9353)&diff=prev&oldid=856169186 <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 11:24, 23 August 2018 (UTC)

See also. Headbomb {t · c · p · b} 11:41, 23 August 2018 (UTC)

Martin  (Smith609 – Talk)  19:03, 24 August 2018 (UTC)

Broken dois and removal of URLs
Maybe the bot shouldn't remove URLs with doi's in them, if the doi is broken (with a doi-broken-date). https://en.wikipedia.org/w/index.php?title=Israel&diff=prev&oldid=856185012

From to <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:03, 23 August 2018 (UTC)

Great report, thanks. Martin  (Smith609 – Talk)  18:38, 24 August 2018 (UTC)

UTF-8 encoding in JSTOR data not taken into account
The bot added this from JSTOR. It both looks weird, and the first1 doens't have a  to display the &#xd3; character (which should be used instead directly instead of using HTML code. |last1 = m&#38;#Xf3;rdha|first1 = Pilip &#xd3 https://en.wikipedia.org/w/index.php?title=Achaidh_Leithdeircc&diff=prev&oldid=856303097 <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 08:32, 24 August 2018 (UTC)
 * See correct edit: https://en.wikipedia.org/w/index.php?title=Achaidh_Leithdeircc&diff=856304296&oldid=856303097 <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 08:34, 24 August 2018 (UTC)

Martin  (Smith609 – Talk)  18:22, 24 August 2018 (UTC)

Further simplify research gate URLs
The bot should simplify

to See, right before.
 * https://www.researchgate.net/profile/Qi-fu_Chen/publication/252679525_The_2008_Wenchuan_Earthquake_and_Earthquake_Prediction_in_China/
 * https://www.researchgate.net/publication/252679525

In general, the structure will be

Headbomb {t · c · p · b} 11:24, 26 August 2018 (UTC)

404 on doibot.php
The correct url ends in process_page.php AManWithNoPlan (talk) 17:27, 26 August 2018 (UTC)


 * There should perhaps be an 'internal redirect', since several scripts make use of it. Headbomb {t · c · p · b} 17:49, 26 August 2018 (UTC)
 * the official method has been fixed. Almost all of those have been broken for many years. AManWithNoPlan (talk) 17:53, 26 August 2018 (UTC)
 * the correct search is https://en.wikipedia.org/w/index.php?search=insource%3A%2Fdoibot%5C.php%2F+-insource%3ADOI_bot&title=Special:Search&profile=advanced&fulltext=1&ns2=1&searchToken=4x836wn6m7sw0hmzfe6bs818k  not the one that you posted.  I fixed both publicly editable pages, leaving a few peoples common.js files. AManWithNoPlan (talk) 18:02, 26 August 2018 (UTC)
 * When you put stuff in your our files such as User:Josve05a/citations.js, instead of just turning on the the citation bot option, you get what you get and you don't throw a fit. AManWithNoPlan (talk) 18:03, 26 August 2018 (UTC)
 * I'm not using that. I'm using User:Smith609/citations.js. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 18:10, 26 August 2018 (UTC)
 * I've even forgotten I did even did that. It was due to the "official script" was broken for a few weeks/months back in 2014 if I can remember. It had been working ever since, so somehting recently changed. I didn't (mean to) throw a fit, I just wanted to draw attention to that the "old way" the script worked, just broke. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 18:06, 26 August 2018 (UTC)


 * I have a copy of it for the dev version so, I had to fix that myself too. The "don't throw fit" is an american phrase used to warn kids in advance that complaints will not be listen too (my using is was mostly in jest). Here are some links to the official version  AManWithNoPlan (talk) 18:11, 26 August 2018 (UTC)

https://en.wikipedia.org/wiki/MediaWiki:Gadget-citations.js

https://en.wikipedia.org/wiki/MediaWiki_talk:Gadget-citations.js

https://en.wikipedia.org/wiki/Wikipedia:Citation_expander
 * I hope you understand I only want to help out (with my limited knowledge) by reporting issues which are affecting me or which I'm noticing, and not trying to complain. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 18:22, 26 August 2018 (UTC)
 * The Smith scripts are not really updated anymore since we got official support from media wiki. He should probably remove them actually.  AManWithNoPlan (talk) 18:12, 26 August 2018 (UTC)
 * Yeah, some are still using it. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 18:13, 26 August 2018 (UTC)


 * Thanks for the report. I've updated the Smith script, and will keep an eye out for other outdated links (I couldn't turn any others up by Google).  A redirect is probably a good idea too. Martin  (Smith609 – Talk)  08:43, 27 August 2018 (UTC)

No longer expands/looks for bibcodes?

 * And it even says "No need to repeat AdsAbs search for 1994ASAJ...96.1218T" 😏🤔🙄😂  AManWithNoPlan (talk) 22:24, 26 August 2018 (UTC)


 * Obviously! The bot knows it would just fail again! Headbomb {t · c · p · b} 22:45, 26 August 2018 (UTC)
 * This is partially fixed: Headbomb {t · c · p · b} 11:58, 27 August 2018 (UTC)


 * Now fully fixed. . Headbomb {t · c · p · b} 16:31, 27 August 2018 (UTC)

Google Books publisher
Please forget all variants of Books.google.com from (such as publisher, work, website, but also Google Books, Google, google.com etc. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 19:28, 22 August 2018 (UTC)
 * See https://en.wikipedia.org/w/index.php?title=Japanese_hip_hop&diff=prev&oldid=856085189 <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 19:28, 22 August 2018 (UTC)
 * Agreed! Headbomb {t · c · p · b} 19:48, 22 August 2018 (UTC)
 * I hate seeing publisher=Google. Copyright-Thief=Books.Google.Com is cool though. AManWithNoPlan (talk) 01:00, 23 August 2018 (UTC)
 * Is  permissible? Martin  (Smith609 – Talk)  07:45, 25 August 2018 (UTC)
 * It is, but it's ... a relatively spammy practice. I remove it, but I don't think it would be appropriate for the bot to remove/add it. Headbomb {t · c · p · b} 20:08, 26 August 2018 (UTC)


 * Conservatively addressed in https://github.com/ms609/citation-bot/pull/672; thoughts welcome before I commit... Martin  (Smith609 – Talk)  07:54, 25 August 2018 (UTC)

fixed

Bug archiving is off temporarily
Once Cluebot III is running again this page will get cleaned up.
 * notabug flagging so that this goes away when fixed. AManWithNoPlan (talk) 20:25, 24 August 2018 (UTC)
 * Was just wondering about that myself. Thanks for checking! Martin  (Smith609 – Talk)  20:19, 24 August 2018 (UTC)
 * still down. https://en.wikipedia.org/wiki/Special:Contributions/ClueBot_III   AManWithNoPlan (talk) 20:24, 24 August 2018 (UTC)
 * It is back online.  It may take a while to get to us. AManWithNoPlan (talk) 13:22, 27 August 2018 (UTC)

in cite journal, convert work/magazine/website to journal, in cite magazine, convert work/journal/website to magazine
We don't handle Cite Magazine. the bot'll now rename magazine to journal in cite journal. Martin  (Smith609 – Talk)  07:31, 25 August 2018 (UTC)

Gadget API is down--fix submitted
https://github.com/ms609/citation-bot/pull/676 AManWithNoPlan (talk) 19:45, 26 August 2018 (UTC)

fixed

WTF citation change
Weird. Must be new multi-template code since it works fine on a single template AManWithNoPlan (talk) 02:36, 27 August 2018 (UTC)
 * I cannot reproduce it. Double weird. AManWithNoPlan (talk) 03:54, 27 August 2018 (UTC)


 * Was caused by an invisible NSBP, which WP:WikEd exposed when I edited the page. Removing it fixed the cause of the issue locally, but  seems to fix the issue being triggered in the first place everywhere. Headbomb {t · c · p · b} 11:54, 27 August 2018 (UTC)

Adds year when there is date
When it finds an arXiv year it adds it. AManWithNoPlan (talk) 02:35, 27 August 2018 (UTC)
 * Dupe of report above User talk:Citation bot/Archive_9. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 11:57, 27 August 2018 (UTC)

Upgrade journal names

 * This may need some research to see if there are exceptions. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:14, 27 August 2018 (UTC)
 * I have always felt this would be good idea, but J Food is probably not Journal Food, but journal of Food or the journal of food.  Every journal would be a special case.  AManWithNoPlan (talk) 13:19, 27 August 2018 (UTC)
 * If the  is of the end of a journal and the word   (or magazine) is not present, then it sounds to me a good bet that it should be replaced with Foo Journal. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 13:26, 27 August 2018 (UTC)


 * Very, very bad idea, per WP:CONTEXTBOT, and per lack of consensus. If you want this to be done on a specific page, delete the abbreviations and run the bot again. Headbomb {t · c · p · b} 13:34, 27 August 2018 (UTC)
 * that’s what I do.  Many people would consider this upgrade to be ‘worse than vandalism’. I consider them wrong, but there would blood everywhere  AManWithNoPlan (talk) 13:52, 27 August 2018 (UTC)
 * (We could start a (small) list, and maintain it, and adding new journals one by one...but that's way much work) <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:53, 27 August 2018 (UTC)
 * The list or journal is huuuge, and you may run awry of WP:CITEVAR by messing consistently abbreviated journals in an article to a mish-mash of abbreviated-and-not-abbreviated journals in the same article. But if you get consensus for something like this, this would be better addressed by a different, possibly new bot. Headbomb {t · c · p · b} 14:10, 27 August 2018 (UTC)
 * Yes, a separate bot with separate approval. One page of php code.  300 pages of journal names. AManWithNoPlan (talk) 15:00, 27 August 2018 (UTC)

Support new parameter bot-deny
Note, the parameter doesn't yet exist, but I've made a request for it. Coding support for it should be rather trivial though. Headbomb {t · c · p · b} 15:40, 21 August 2018 (UTC)


 * No need for this, I found a better solution: User:Citation bot/use. Headbomb {t · c · p · b} 03:33, 29 August 2018 (UTC)

Journal caps

 * There is no consensus for messing around with valid abbreviations. Fixing caps / title case is fine, but converting abbreviations to non-abbreviations is not, at least not without a strong consensus to do so. Headbomb {t · c · p · b} 13:31, 27 August 2018 (UTC)
 * In that case, it should keep caps. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:39, 27 August 2018 (UTC)

Add support for non-standard jstor stable IDs
That has actually been debated and the belief at the time was that DOI was better than JSTOR and if they were exactly the same, then only list the DOI. This DOI is not owned by JSTOR, so it is not truly stable. JSTOR owns 10.2307 AManWithNoPlan (talk) 23:20, 27 August 2018 (UTC) notabug

Bibcodes still not reliably expanded
The bot does sometimes get throttled by databases. Also, sometimes they just fail. AManWithNoPlan (talk) 14:51, 29 August 2018 (UTC)

Notice: updated user page
I updated the User:Citation bot user page to be prettier / more user friendly. Feel free to tweak. I also gave a major revamp the User:Citation bot/use guide. Please review/tweak, you can discuss the revamped guide at User talk:Citation bot/use. Headbomb {t · c · p · b} 17:17, 25 August 2018 (UTC)
 * Great! Thanks! Martin  (Smith609 – Talk)  09:00, 27 August 2018 (UTC)
 * after a while just change that talk page to a redirect to here AManWithNoPlan (talk) 22:14, 27 August 2018 (UTC)

fixeded it up very nice!!! AManWithNoPlan (talk) 16:11, 31 August 2018 (UTC)

Both converts Template:Ndash to hyphen... and then to ndash

 * it has a page number without a - character so the bot assumes a single page number and thus when it gets a range of pages it upgrades to the range. Pages are one of the few things we might blow away and replace.  AManWithNoPlan (talk) 04:45, 25 August 2018 (UTC)

Don't overwrite comments

 * The bot has been overwriting comments for a long time. AManWithNoPlan (talk) 15:12, 21 August 2018 (UTC)
 * Oh dear; I wonder how much dud information has been introduced as a result! Even if there is a better way to deal with false positives, I'm not sure how we might make a transition now that a standard has been set...  Martin  (Smith609 – Talk)  15:20, 21 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/630 page numbers AManWithNoPlan (talk) 20:44, 21 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/629 series and ISSN AManWithNoPlan (talk) 20:44, 21 August 2018 (UTC)

I think that once these two pulls are in, the code should respect comments again. AManWithNoPlan (talk) 23:59, 21 August 2018 (UTC)


 * Still does it btw . Headbomb {t · c · p · b} 16:37, 24 August 2018 (UTC)
 * of course it does. The code has not been changed yet AManWithNoPlan (talk) 17:12, 24 August 2018 (UTC)


 * Well it's pretty hard to tell what's in the bot and what isn't. Headbomb {t · c · p · b} 17:15, 24 August 2018 (UTC)

More than one of |at= and |pages= specified
Why have one page keyword when you can have five! (“At” is the only one that is special) https://github.com/ms609/citation-bot/pull/692 AManWithNoPlan (talk) 19:03, 29 August 2018 (UTC)

Fails to cleanup CiteSeerX url
https://github.com/ms609/citation-bot/pull/691 AManWithNoPlan (talk) 16:34, 28 August 2018 (UTC)

Removes periods in abbreviations when at the end of a title
Duplicate of User_talk:Citation_bot. Headbomb {t · c · p · b} 23:24, 31 August 2018 (UTC)