User talk:Citation bot/Archive 12

Request: Strip commas at the end of parameters
This should apply to all parameters. There's no reason for any parameter to end with a comma. Headbomb {t · c · p · b} 23:43, 22 August 2018 (UTC)
 * Except perhaps ? Martin  (Smith609 – Talk)  07:45, 25 August 2018 (UTC)
 * Yes, that would be the exception. Headbomb {t · c · p · b} 13:31, 25 August 2018 (UTC)
 * AManWithNoPlan (talk) 16:46, 1 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1011 AManWithNoPlan (talk) 16:51, 1 November 2018 (UTC)
 * Presumably you mean parameter values. And Martin's special case would ,, where the comma is the value, not some trailing cruft. &diams; J. Johnson (JJ) (talk) 20:32, 2 November 2018 (UTC)
 * cs1|2 does not have or support author-sep, author-name-separator, or separator.
 * Those parameters were deprecated and removed when mode was instituted.
 * One might legitimately set ,:
 * —Trappist the monk (talk) 20:43, 2 November 2018 (UTC)
 * Those parameters were deprecated and removed when mode was instituted.
 * One might legitimately set ,:
 * —Trappist the monk (talk) 20:43, 2 November 2018 (UTC)
 * —Trappist the monk (talk) 20:43, 2 November 2018 (UTC)
 * —Trappist the monk (talk) 20:43, 2 November 2018 (UTC)
 * —Trappist the monk (talk) 20:43, 2 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1020 AManWithNoPlan (talk) 20:59, 2 November 2018 (UTC)

Request: chapter-format
https://github.com/ms609/citation-bot/pull/1017 AManWithNoPlan (talk) 14:06, 2 November 2018 (UTC)

Bug: Spaces in ref-tags
https://github.com/ms609/citation-bot/pull/1018 AManWithNoPlan (talk) 17:00, 2 November 2018 (UTC)

Interesting new ability for Citoid
T124610 or rather T198567 might be of interest for the bot as well. (t) Josve05a  (c) 16:14, 2 November 2018 (UTC)
 * Please keep an eye on it.  When it gets really reliable; then we might add it.  For now notabug.  AManWithNoPlan (talk) 14:10, 5 November 2018 (UTC)

PMID low numers

 * In https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867085176 the bot did not add 1 or 11442. (t) Josve05a  (c) 15:20, 3 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1021 AManWithNoPlan (talk) 18:52, 3 November 2018 (UTC)

Request: Expand doi citations in cite news as well
https://github.com/ms609/citation-bot/pull/1019 AManWithNoPlan (talk) 17:09, 2 November 2018 (UTC)

Do not override specific page with page range
https://github.com/ms609/citation-bot/pull/1026 AManWithNoPlan (talk) 15:36, 5 November 2018 (UTC)

Remove doi.org if adding doi parameter
we only remove the URL if the doi is in CrossRef. Probably should make an execption for doi.org. AManWithNoPlan (talk) 14:29, 5 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1025 AManWithNoPlan (talk) 15:33, 5 November 2018 (UTC)

Emergency blacklist
zenodo.org has been blacklisted on English Wikipedia for copyright reasons (at least for now). Please disable the addition of it (and allow other edits to be made; the bot currently fails on Radon). See Special:PermanentLink/867438103. (Courtesy ping and  ) (t)  Josve05a  (c) 21:42, 5 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1029 (t) Josve05a  (c) 21:51, 5 November 2018 (UTC)
 * and yet researchgate is cool. AManWithNoPlan (talk) 22:20, 5 November 2018 (UTC)
 * i am glad we currently clean up those urls, so we convert pdf links to landing pages. AManWithNoPlan (talk) 22:22, 5 November 2018 (UTC)
 * Well, even cleaning up https://zenodo.org/record/1000677/files/article.pdf to https://zenodo.org/record/1000677 is blacklisted (/me mumbles something angrily) (t) Josve05a  (c) 22:34, 5 November 2018 (UTC)
 * Yes. A friend cannot upload her papers to ResearchGate but can upload them to Zenodo. I think that may be telling. Guy (Help!) 23:40, 5 November 2018 (UTC)
 * Blocked and fixed. Also, a second pull is in place to turn it back off, if that is possible.  You are correct, it is one thing to violate your own papers' copyright; but it is another thing to violate everyone's papers copyrights. AManWithNoPlan (talk) 18:25, 6 November 2018 (UTC)

Bug: Publisher weirdness

 * Did you verify that there are not whites that got changed? i cannot look right now. AManWithNoPlan (talk) 15:44, 13 October 2018 (UTC)


 * I'm not sure what 'changing whites' would be here, but it did something similar in the previous edit, where normally it removes publisher in cite journals. Headbomb {t · c · p · b} 18:14, 13 October 2018 (UTC)
 * If it does than that's another bug. It shouldn't remove the publisher parameter in cite journal templates unless the publisher value would be the same as the journal value. (And, actually, for optimal meta data it shouldn't even remove it then for as long as it is correct, so that both meta data entries journal and publisher can be populated. Instead, seemingly duplicate values should be detected in the cite template and one of the values suppressed in the output, but not in meta data.)
 * --Matthiaspaul (talk) 11:44, 15 October 2018 (UTC)
 * i got auto corrected.  whitespaces not whites. AManWithNoPlan (talk) 22:37, 13 October 2018 (UTC)
 * it drops publisher then google books adds it back AManWithNoPlan (talk) 03:55, 14 October 2018 (UTC)
 * I just saw it remove a publisher from a "journal" that is really a newsletter whose publisher should not have been removed: Special:Diff/866664956. For major well-established academic journals, removal of publisher may be a good thing, but blindly doing it to all journal citations is not. Citation bot absolutely should not be making this kind of decision, and should not even be suggesting it to human editors (as they too-often fail to exercise any judgement of their own). —David Eppstein (talk) 21:02, 31 October 2018 (UTC)

cite magazine seems like the proper template is what i am hearing from you. AManWithNoPlan (talk) 21:32, 31 October 2018 (UTC)
 * This should stop the drop/add cycle https://github.com/ms609/citation-bot/pull/1024 AManWithNoPlan (talk) 15:27, 5 November 2018 (UTC)

Adding publishers to journal
https://github.com/ms609/citation-bot/pull/1024 AManWithNoPlan (talk) 15:27, 5 November 2018 (UTC)

Bug: bot has to be run twice
https://github.com/ms609/citation-bot/pull/1028 AManWithNoPlan (talk) 15:48, 5 November 2018 (UTC)

Better europepmc.org support
Try running the bot on. You would expect to get something like: but instead you will get

Completely missing the authors and IDs. (t) Josve05a  (c) 23:49, 4 November 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1027 AManWithNoPlan (talk) 15:40, 5 November 2018 (UTC)
 * fixed

CAPS-insensitive URLs
https://github.com/ms609/citation-bot/pull/1030 AManWithNoPlan (talk) 23:21, 5 November 2018 (UTC)

Remove Pubmed URL without www.
https://github.com/ms609/citation-bot/pull/1030/files AManWithNoPlan (talk) 04:23, 6 November 2018 (UTC)

Do not add URL if PMC exists
https://github.com/ms609/citation-bot/pull/1035 AManWithNoPlan (talk) 13:56, 6 November 2018 (UTC)

Request: The New York Times
https://github.com/ms609/citation-bot/pull/1043 AManWithNoPlan (talk) 23:56, 7 November 2018 (UTC)

Reuters
https://github.com/ms609/citation-bot/pull/1044 AManWithNoPlan (talk) 00:03, 8 November 2018 (UTC)

Bug: Failed to remove doi.org URL
If running the bot again, it removes it. Perhaps a "run bot muliple times, until no changes is attemeted, before saving the edit" rule should be implemented. (t) Josve05a  (c) 22:01, 8 November 2018 (UTC)
 * That is a horrible idea. Although I too have considered it. AManWithNoPlan (talk) 23:06, 8 November 2018 (UTC)
 * Yeah (hence it being in small). I can just imagine the bot edit warring with it self back-and-forth...however, it logically feels as if "all possible edits should be made" before saving the change. (t) Josve05a  (c) 23:09, 8 November 2018 (UTC)
 * And the bot gets banned from database access for repeats and edits take two to three times longer..... AManWithNoPlan (talk) 23:11, 8 November 2018 (UTC)
 * And during periods of high use, big edits fail since the bot is too busy double checking itself.... AManWithNoPlan (talk) 23:12, 8 November 2018 (UTC)
 * A (short) time-out for "second round" could be added, or only do it for "small" articles (i.e. if running it manually on a short section), or only run twice if there is not high-use (if that could be "tracked"). Not advocating this be implemented here, though. The issue at hand can (hopefully) be patched this time. Just a thought.(t) Josve05a  (c) 23:15, 8 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1049 AManWithNoPlan (talk) 23:53, 8 November 2018 (UTC)

Request: More caps
https://github.com/ms609/citation-bot/pull/1042 AManWithNoPlan (talk) 23:13, 8 November 2018 (UTC)

Bug: DOI broken, or not broken - that is the question
Impossible to know, since I do not know what your sent to the bot. AManWithNoPlan (talk) 14:59, 10 November 2018 (UTC)
 * notabug probably is one time fluke AManWithNoPlan (talk) 19:01, 10 November 2018 (UTC)

Bug: Unlinking
drops one them adds other. weird.AManWithNoPlan (talk) 04:39, 31 October 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1050 AManWithNoPlan (talk) 05:21, 10 November 2018 (UTC)

Encyclopedia
Follow-up from User talk:Citation bot/Archive 11

CAPS: Oecd
OECD should always be capitalized. I've seen it both in last1 and publisher. adds Oecd (t) Josve05a  (c) 10:47, 12 November 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1053 AManWithNoPlan (talk) 16:31, 12 November 2018 (UTC)

fixed

Ultra slow
It has been that way for about a day. Sometimes you will get lucky and get a 500 error instead of timeout. AManWithNoPlan (talk) 16:56, 13 November 2018 (UTC)


 * Yup. Either, this is very annoying. Headbomb {t · c · p · b} 17:04, 13 November 2018 (UTC)


 * Although frustrating, these very slow runs do often perform the requested edits even if they never return to display a result. Lithopsian (talk) 20:19, 13 November 2018 (UTC)

fixed

Request: Single quotes misused as arrows should be unchanged
https://github.com/ms609/citation-bot/pull/1059 AManWithNoPlan (talk) 16:27, 14 November 2018 (UTC)

Similar names in newspaper/via/publisher are redundant

 * This could be generalized to anything that differs only by a leading 'the'. Headbomb {t · c · p · b} 16:25, 14 November 2018 (UTC)
 * Good idea The Headbomb. I might want to create a case-intensive str_is_basically_the_same function.  AManWithNoPlan (talk) 16:32, 14 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1066 AManWithNoPlan (talk) 21:12, 14 November 2018 (UTC)

Bug: doi.library.ubc.ca is a DOI resolver, not a website
https://github.com/ms609/citation-bot/pull/1058 AManWithNoPlan (talk) 16:08, 14 November 2018 (UTC)
 * and https://github.com/ms609/citation-bot/pull/1072 AManWithNoPlan (talk) 18:16, 16 November 2018 (UTC)

Bug: Google Maps publisher is Google


https://github.com/ms609/citation-bot/pull/1070 AManWithNoPlan (talk) 16:19, 15 November 2018 (UTC)

Series/Title
https://github.com/ms609/citation-bot/pull/1073 (at this moment there is a backlog of 18 pull requests, since I am working half-days and the bot operator is not). AManWithNoPlan (talk) 18:22, 15 November 2018 (UTC)

similar data in title/journal/publisher/etc
Will this fix this as well? (t) Josve05a  (c) 00:34, 16 November 2018 (UTC)


 * That can't be fixed by the bot, no. Headbomb {t · c · p · b} 02:19, 16 November 2018 (UTC)
 * wontfix AManWithNoPlan (talk) 17:29, 16 November 2018 (UTC)

Change cite book to Cite book
Huh? &diams; J. Johnson (JJ) (talk) 20:32, 15 November 2018 (UTC)
 * Obviously the type of template gets changed more than once as the bot does its thing. I think we can fix. AManWithNoPlan (talk) 20:50, 15 November 2018 (UTC)
 * Also added journal to a . That sounds odd...(t) Josve05a  (c) 21:00, 15 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1074 AManWithNoPlan (talk) 21:30, 15 November 2018 (UTC)

Request: Minimal cite conference support (doi url)
https://github.com/ms609/citation-bot/pull/1075 AManWithNoPlan (talk) 21:33, 15 November 2018 (UTC)

wrong message
In: https://en.wikipedia.org/w/index.php?title=Quarter-inch_cartridge&curid=895102&diff=869312464&oldid=866498559 it changed from ISBN-10 to ISBN-13, which is fine, but the message says that it removed the access-date. That seems a different question, so should have appropriate message. Gah4 (talk) 21:23, 17 November 2018 (UTC)
 * The full message is "Alter: isbn. Removed accessdate with no specified URL". It covers both, but admittedly the amount of texted changed appears to be inversely proportional to the length of the message text. AManWithNoPlan (talk) 22:22, 17 November 2018 (UTC)
 * notabug AManWithNoPlan (talk) 19:34, 18 November 2018 (UTC)

Weird doi and hdl formats which works
What should be done with

The links works, but it is not allowed formats, and the bot does not expand from them. (t) Josve05a  (c) 21:15, 19 November 2018 (UTC)


 * Probably nothing? Let users fix the errors themselves? Headbomb {t · c · p · b} 21:51, 19 November 2018 (UTC)
 * The valid identifiers, btw, are and . Headbomb {t · c · p · b} 21:52, 19 November 2018 (UTC)
 * The doi link works since dx.doi.org will resolve non-doi hdl. AManWithNoPlan (talk) 22:55, 19 November 2018 (UTC)


 * Maybe so, but it's still not a valid DOI. Headbomb {t · c · p · b} 23:02, 19 November 2018 (UTC)
 * What should be done is that the DOI should be fixed by a human to conform with the DOI specifications. DOI.org is under no obligation to support non-conforming DOI values, and they could remove their de facto support at any time. – Jonesey95 (talk) 05:42, 20 November 2018 (UTC)

wontfix and will leave for humans to fix. AManWithNoPlan (talk) 21:30, 20 November 2018 (UTC)

Bug: SSRN Electronic Journal
yeah. that accidently got fixed in a case-sensitive way. waiting for another pull. AManWithNoPlan (talk) 13:24, 20 November 2018 (UTC)
 * fixed AManWithNoPlan (talk) 15:29, 20 November 2018 (UTC)

Request: clean up google search so-called references
While I'm not sure why a citation to Google Search should ever appear in an article, they do quite a lot. It would be good if the bot would remove unnecessary parameters for such URLs as well, as it does with Google Books.

Unnecessary parameters in https://www.google.com/search?q=%22institute+for+sustainable+weight+loss%22&oq=%22institute+for+sustainable+weight+loss%22&aqs=chrome..69i57j69i59.14823j0j7&sourceid=chrome&ie=UTF-8
 * Assisted Query Stats - used for logging purposes only
 * Where the search originated from - used for logging purposes only
 * input encoding; default is UTF-8

Unnecessary parameters in http://www.google.com/search?hl=en&safe=off&client=firefox-a&rls=com.ubuntu%3Aen-US%3Aunofficial&q=%22west+coast+hotel+co.+v.+parrish%22+(site%3Anewsweek.com+OR+site%3Apost-gazette.com+OR+site%3Ausatoday.com+OR+site%3Awashingtonpost.com+OR+site%3Atime.com+OR+site%3Areuters.com+OR+site%3Aeconomist.com+OR+site%3Amiamiherald.com+OR+site%3Alatimes.com+OR+site%3Asfgate.com+OR+site%3Achicagotribune.com+OR+site%3Anytimes.com+OR+site%3Awsj.com+OR+site%3Ausnews.com+OR+site%3Amsnbc.com+OR+site%3Anj.com+OR+site%3Atheatlantic.com)&aq=o&oq=&aqi=
 * Where the search originated from - used for logging purposes only
 * (Not sure if we want to keep this and point to the English version specifically of the search result or not)
 * version of the client - used for loggig purposes
 * Original query - i.e. previous search query
 * Original query - i.e. previous search query

Unnecessary parameters in https://www.google.com/search?q=roosevelt+in+hoxsey%27s+plane (not mentioned earlier): (t) Josve05a  (c) 19:37, 7 October 2018 (UTC)
 * On VERY rare occasions they are valid (example: the term xyz is more popular/common than zyx on the Internet). Almost all the time, it would be more honest to just say AManWithNoPlan (talk) 20:01, 7 October 2018 (UTC)
 * While I don't disagree with you (at all), I still feel we (read: the bot) should act as if they are all valid, and clean them, and hope that someone else comes along and finds (any) better references. (t) Josve05a  (c) 20:05, 7 October 2018 (UTC)

aqs=chrome..69i57j69i59.14823j0j7 Assisted Query Stats - used for logging purposes only sourceid=chrome Where the search originated from - used for logging purposes only ie=UTF-8 input encoding; default is UTF-8

Unnecessary parameters in http://www.google.com/search?hl=en&safe=off&client=firefox-a&rls=com.ubuntu%3Aen-US%3Aunofficial&q=%22west+coast+hotel+co.+v.+parrish%22+(site%3Anewsweek.com+OR+site%3Apost-gazette.com+OR+site%3Ausatoday.com+OR+site%3Awashingtonpost.com+OR+site%3Atime.com+OR+site%3Areuters.com+OR+site%3Aeconomist.com+OR+site%3Amiamiherald.com+OR+site%3Alatimes.com+OR+site%3Asfgate.com+OR+site%3Achicagotribune.com+OR+site%3Anytimes.com+OR+site%3Awsj.com+OR+site%3Ausnews.com+OR+site%3Amsnbc.com+OR+site%3Anj.com+OR+site%3Atheatlantic.com)&aq=o&oq=&aqi=


 * So, these are the ones to lose client (assume language= is actually important)

rls= aq= oq= aqi= tbm= sa= ved= biw= bih= aqs= sourceid= client= ie=UTF-8 (Since this is default, but keep any other)
 * AManWithNoPlan (talk) 02:24, 20 October 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1079 and https://github.com/ms609/citation-bot/pull/1080 AManWithNoPlan (talk) 05:09, 16 November 2018 (UTC)
 * fixed AManWithNoPlan (talk) 17:18, 21 November 2018 (UTC)

Bad Data detection: SSRN Electronic Journal
https://github.com/ms609/citation-bot/pull/1057 AManWithNoPlan (talk) 15:10, 14 November 2018 (UTC)
 * And https://github.com/ms609/citation-bot/pull/1081 AManWithNoPlan (talk) 19:37, 16 November 2018 (UTC)

existing wrong information is not fixed
Please see. It seems extremely improbable that the issue number and page number would both be 061102. JRSpriggs (talk) 20:59, 21 November 2018 (UTC)
 * notabug the bot generates perfect output and leaves user input fields alone. AManWithNoPlan (talk) 21:22, 21 November 2018 (UTC)

Removal of reference link not justified
The style guides are very clear on not including publishers for Journals. 99% of the time the pdf links to publisher pdfs do not work, and even when they do, they often do not last for long. Anyway, it adds nothing that the doi already provides. AManWithNoPlan (talk) 16:30, 25 November 2018 (UTC)
 * the correct publisher is tandy anyway. As usual, it was wrong. AManWithNoPlan (talk) 16:40, 25 November 2018 (UTC)
 * wontfix

Citing magazine
I do not think this fixable, since the only way is to maintain a list of 10,000 magazines. Also, the template are actually exactly the same. AManWithNoPlan (talk) 15:40, 24 November 2018 (UTC)


 * Why did it convert from cite web to cite journal? -- Green  C  15:47, 24 November 2018 (UTC)
 * They are not exactly the same. The rendering of issue and number differs, and you cannot set none in cite magazine (there may be other differences). --Izno (talk) 18:41, 24 November 2018 (UTC)
 * That's news to me. I see that this is fairly new change.  AManWithNoPlan (talk) 19:09, 24 November 2018 (UTC)
 * This pull will help a lot https://github.com/ms609/citation-bot/pull/1104 AManWithNoPlan (talk) 03:33, 27 November 2018 (UTC)
 * Much better and mostly fixed

Publisher
All style guides reject including that information for journals. Also, it is often incorrect. The bot has been doing this for over a decade, so I am sure there are other examples. AManWithNoPlan (talk) 15:34, 27 November 2018 (UTC)
 * Flagging as notabug until debate is over and this is finalized once and for all. AManWithNoPlan (talk) 17:27, 27 November 2018 (UTC)

CAPS: A.I.E.E.
https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 16:46, 14 November 2018 (UTC)

"Om" vs "om"
Example Meddelelser om Grønland (t) Josve05a  (c) 16:59, 14 November 2018 (UTC)
 * I have have to think about Om vs. om. For now, adding journal titles is good first step.  https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 17:03, 14 November 2018 (UTC)
 * Cf. "A" used uppercase for initialism in . Nemo 07:48, 15 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 16:34, 15 November 2018 (UTC)

Request: JMIR mHealth and uHealth
The Feedback from the Maintainers is that publishers need to be less self-important. 😀😬😜😂 AManWithNoPlan (talk) 23:56, 23 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 00:00, 24 November 2018 (UTC)

More caps: PeerJ
Added to https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 19:34, 25 November 2018 (UTC)

Request: Bot adds - and . as names
Authors: - -. is and interesting author. AManWithNoPlan (talk) 14:52, 26 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1103 AManWithNoPlan (talk) 01:20, 27 November 2018 (UTC)
 * and now https://github.com/ms609/citation-bot/pull/1107 AManWithNoPlan (talk) 16:40, 28 November 2018 (UTC)

Fails to remove PMC url
https://github.com/ms609/citation-bot/pull/1106 AManWithNoPlan (talk) 16:40, 28 November 2018 (UTC)

Bug: JSTOR API is a cookie monster and wants his cookies
What gives? Temporary issues, or a bug somewhere? <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 23:27, 5 November 2018 (UTC)
 * About seven hours ago the test suite suddenly stopped working. AManWithNoPlan (talk) 23:29, 5 November 2018 (UTC)
 * We are blocked. AManWithNoPlan (talk) 23:57, 5 November 2018 (UTC)

Request: Better UvA-DARE (Digital Academic Repository) support
Is there an API or a way to "find this"? Or is it too much work? <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:10, 22 November 2018 (UTC)
 * Many users prefer direct links to PDF files rather than records (although librarians and website owners prefer links to HTML pages so that they can track the users more easily). That said, this repository attempts to provide the handle in its HTML metadata, but is misconfigured: <meta name="DC.identifier" content="http://hdl.handle.net11245/1.345005"> (slash missing). I suggest to warn the repository administrators. Their records on BASE are also all broken, some OAI-PMH fixes are in order. Nemo 14:47, 22 November 2018 (UTC)

wontfix at this time. AManWithNoPlan (talk) 22:38, 7 December 2018 (UTC)

Slow
Also not working for me. I asked it to check The Bill, so far 25 minutes and it's done nothing.-- 5 albert square (talk) 13:43, December 2018 (UTC)

wontfix shared server and sadly when it gets slow people often just start trying again and again thus making it worse (similar to shooting someone because they are bleeding and hoping it will help) AManWithNoPlan (talk) 15:32, 7 December 2018 (UTC)


 * Would displaying an error message of some kind be possible here? Something like " is at capacity, try again in <ammount of time depending on server load>"? Headbomb {t · c · p · b} 23:38, 7 December 2018 (UTC)
 * I have an idea. AManWithNoPlan (talk) 00:34, 8 December 2018 (UTC)

Bug: Do not add OA link if adding free ID
This is not a regression. The URL is added before the PMC is present. Will have to think about this. Perhaps move adding Open URL to the end would be best. AManWithNoPlan (talk) 16:46, 23 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1099 AManWithNoPlan (talk) 22:49, 23 November 2018 (UTC)

Added additional alias of "work" paramter
very rare. fix https://github.com/ms609/citation-bot/pull/1108 AManWithNoPlan (talk) 04:02, 30 November 2018 (UTC)

Sackur–Tetrode equation
On November 19th, you removed two wikilinks from Sackur–Tetrode equation. Both wikilinks seem to be useful; so I restored them. To me, the removal of the wikilinks indicates a bug. 81.153.242.15 (talk) 15:41, 30 November 2018 (UTC)
 * removal of partial wikilinks is not a bug. you need to wikilink the entire journal name or it will be removed by the bot. AManWithNoPlan (talk) 16:27, 30 November 2018 (UTC)

Weird Citation bot bug
https://github.com/ms609/citation-bot/pull/1109 once implemented this will stop bot from trying to expand this website. AManWithNoPlan (talk) 18:27, 1 December 2018 (UTC)

Non-names
https://github.com/ms609/citation-bot/pull/1110 AManWithNoPlan (talk) 15:57, 6 December 2018 (UTC)

Bug: doi's with plus signs
In this edit: <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 09:30, 12 November 2018 (UTC)
 * Removed  in the 10.1002/1097-0142(19840201)53:3+<815::AID-CNCR2820531334>3.0.CO;2-U
 * Regression of https://github.com/ms609/citation-bot/pull/993


 * it is interesting that Wiley cannot handle the doi either. plus signs are a horrible choice. AManWithNoPlan (talk) 19:06, 12 November 2018 (UTC)
 * Anyway to get the cite template to enclode the url better so Wiley can resolve it, or is this up to crossref/Wiley to fix? <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 22:56, 15 November 2018 (UTC)
 * waiting for bot to come alive to debug AManWithNoPlan (talk) 03:21, 13 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1054 AManWithNoPlan (talk) 16:47, 14 November 2018 (UTC)
 * Not only conveting existing doi's, but also adding bad doi's :/ <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 22:53, 15 November 2018 (UTC)
 * That is no surprise. AManWithNoPlan (talk) 23:30, 15 November 2018 (UTC)
 * No, but still sad. A bit surprised though that it didn't add doi-broken-date, but I guess it tests if broken before parsing what to write. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 23:47, 15 November 2018 (UTC)
 * when it gets url encoded, the space becomes a plus sign. When people start using doi with spaces and emojis it is going to suck AManWithNoPlan (talk) 00:02, 16 November 2018 (UTC)
 * Ugggh! Horrible thoughts! Burn them before they end up in doi's! <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 11:38, 16 November 2018 (UTC)

fixed AManWithNoPlan (talk) 14:30, 10 December 2018 (UTC)

Request: Process website dates more
https://en.wikipedia.org/w/index.php?title=School-to-prison_pipeline&diff=872325415&oldid=872325081

https://en.wikipedia.org/w/index.php?title=Standard_of_living_in_Israel&diff=870417470&oldid=868413825


 * https://github.com/ms609/citation-bot/pull/1098 AManWithNoPlan (talk) 23:08, 23 November 2018 (UTC)

Reuters

 * When the actuall website is Reuters.com, it whould be the work (such as newspaper), but while Reuters is the author of an article on another website (such as theguardian/nytimes) it should be agency. In this case Reuters be removed. Both Reuters and Reuters should not be present. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 14:59, 24 November 2018 (UTC)
 * Same proble as with assocaited press AManWithNoPlan (talk) 17:54, 24 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1102 AManWithNoPlan (talk) 19:31, 24 November 2018 (UTC)

fixed

be less exact with agency

 * In the same edit, it did not add an extra parameter for the Associated Press of Pakistan and for Agence France-Presse. All these agencies can often be called a couple of different names (e.g. AP, the Associated Press, or Associated Press), so that might be an issue. w umbolo   ^^^  19:44, 24 November 2018 (UTC)
 * I have added to pull 1102 some code to make it less exact. AManWithNoPlan (talk) 23:16, 24 November 2018 (UTC)

fixed

Date format
notabug the accessdate is formatted wrong, not what we did. The page says undefined AManWithNoPlan (talk) 16:47, 10 December 2018 (UTC)

Request: handle non-escaped dx.doi.org URL
https://github.com/ms609/citation-bot/pull/1072 AManWithNoPlan (talk) 16:44, 15 November 2018 (UTC)

and https://en.wikipedia.org/w/index.php?title=Instar&type=revision&diff=872352613&oldid=872351168

fixed

Cite arXiv should have capital X
fixed

Bot renames parameters to create duplicate alias of existing "work" parameter

 * https://github.com/ms609/citation-bot/pull/1100 AManWithNoPlan (talk) 23:07, 23 November 2018 (UTC)
 * Not sure how to tell if this is fixed, but if it was, it didn't work: edit at 19:57, 29 November 2018, see citation with title beginning "USA cyclist Tejay van Garderen". DferDaisy (talk) 01:31, 30 November 2018 (UTC)

fixed

Running bot twice (again)
https://github.com/ms609/citation-bot/pull/1101 AManWithNoPlan (talk) 17:13, 24 November 2018 (UTC)

fixed

The tool seever is down
fixed

Server got Nuuk'd?
fixed

Unexpected data found in parse_plain_text_reference. Citation bot cannot parse.

 * Thank you for the report. This comes from arXiv data.  We support about a dozen formats that they use.   This helps us decode new ones (or in some cases detect and not decode). AManWithNoPlan (talk) 15:33, 21 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1118 AManWithNoPlan (talk) 19:19, 15 December 2018 (UTC)

chapter= added to Cite encyclopedia without removing title=
chapter is not a documented parameter in cite encyclopedia. title is supposed to be used for the encyclopedia entry. The bot should probably not add chapter at all when title is present, and it definitely should not add chapter and leave title in place. – Jonesey95 (talk) 18:53, 8 December 2018 (UTC)
 * The bot's edit summary was also partially incorrect in this edit, in that it claimed to have "Removed parameters", but it did not do so. – Jonesey95 (talk) 18:54, 8 December 2018 (UTC)
 * This should help a lot https://github.com/ms609/citation-bot/pull/1121 AManWithNoPlan (talk) 00:59, 16 December 2018 (UTC)

Convert worlcat.org urls with titles in urls to oclc also
https://github.com/ms609/citation-bot/pull/1114 AManWithNoPlan (talk) 04:00, 14 December 2018 (UTC)

GBIF
(since reverted) had a number of issues, not least that no journal is involved. Andy Mabbett ( Pigsonthewing ); Talk to Andy; Andy's edits 16:46, 14 December 2018 (UTC)
 * Debugging now. AManWithNoPlan (talk) 18:59, 15 December 2018 (UTC)
 * Weird. i cannot get it to reproduce AManWithNoPlan (talk) 00:46, 16 December 2018 (UTC)
 * will just flag wontfix

Removal of valid URL from tag
The doi link points to the exact same page and is not prone to breaking as publisher links are. also, this case the pdf file is actually free which is a very unusual for a publisher website. AManWithNoPlan (talk) 21:04, 15 December 2018 (UTC)

Fails on Probiotic
notabug banned urls cannot be in modified text. z e n o d o.  AManWithNoPlan (talk) 22:49, 19 December 2018 (UTC)

eScholarship
code change submitted AManWithNoPlan (talk) 01:49, 20 December 2018 (UTC)

Wish list
Wish list wontfix they said no

Discussion: non-functional jstor dois
Any thoughts on which of these is better:

Should that bot remove the non-functional doi when it the same as the jstor link with 10.2307 added in front of it? AManWithNoPlan (talk) 16:30, 18 November 2018 (UTC)
 * I prefer the second version only, or at least not displaying inactive doi's if other IDs exists. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 21:16, 19 November 2018 (UTC)


 * Non-functional DOI links of the form 10.2307/<JSTORID> can be removed if they are broken. Working JSTOR dois, or JSTOR dois of a different form should be left alone. I believe JSTOR used to have internal redirects, but no longer do, so that's why we've got a bunch of crap 10.2307/<JSTORID> DOIs laying around. Headbomb {t · c · p · b} 21:49, 19 November 2018 (UTC)
 * Anecdotally, sometimes the works where the JSTOR ID doesn't correspond to a working DOI actually have another DOI from a publisher. I'm not sure if these DOIs were never issued or what. Nemo 23:04, 20 November 2018 (UTC)
 * That is correct, some do not actually have the doi issued. Some have one from the publisher and one from jstor (and maybe one from researchgate and and who knows who else.  AManWithNoPlan (talk) 01:22, 21 November 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1127 AManWithNoPlan (talk) 17:20, 20 December 2018 (UTC)

fixed

Request: clean up sciencedirect URLs
Search results for ?via%3Dihub. Perhaps remove all "via" URL-parameters for that link. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 13:08, 21 November 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1128 AManWithNoPlan (talk) 17:34, 20 December 2018 (UTC)

Newspapers with multiple names
https://github.com/ms609/citation-bot/pull/1129 AManWithNoPlan (talk) 22:10, 20 December 2018 (UTC)
 * and https://github.com/ms609/citation-bot/pull/1131 AManWithNoPlan (talk) 17:02, 21 December 2018 (UTC)

BBC Sport
Why do you say that? AManWithNoPlan (talk) 15:33, 7 December 2018 (UTC)
 * Please refer to this discussion. Mattythewhite (talk) 13:51, 8 December 2018 (UTC)
 * I will think about the solution since bbc (not bbc sports) is the publisher. Newspaper is one of the many work aliases.  AManWithNoPlan (talk) 21:16, 8 December 2018 (UTC)
 * See also this discussion. The use of BBC Sport is a well-established norm and there is consensus for it. Nzd   (talk)  08:46, 13 December 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1130 AManWithNoPlan (talk) 02:16, 21 December 2018 (UTC)

Fails to convert urls with library proxies
https://github.com/ms609/citation-bot/pull/1115 AManWithNoPlan (talk) 04:24, 14 December 2018 (UTC)


 * Two jstor areas in code. Now need to do part right after plants. AManWithNoPlan (talk) 16:58, 21 December 2018 (UTC)

please add more entrez support
https://github.com/ms609/citation-bot/pull/1116 AManWithNoPlan (talk) 19:06, 15 December 2018 (UTC)

API: Silent/Verbose mode for category
Add a 'silent' mode. This would simplify the output to simply -- [12:13:02] Processing page '2018 FFA Cup preliminary rounds' – edit – history when there is no changes made and -- [12:13:02] Processing page '2018 FFA Cup preliminary rounds' – edit – history when there is a change made. This could probably made 'default' for categories, with  to disable it. Or alternatively,  to enable verbose logs. Headbomb {t · c · p · b} 12:25, 21 August 2018 (UTC)
 * 1) No changes required.
 * 1) Updating the page (diff).
 * difficult to fix: pages that take a while to process will cause an HTTP disconect.  AManWithNoPlan (talk) 13:13, 31 October 2018 (UTC)
 * not sure what's that got to do with a simplified output in general? Headbomb {t · c · p · b} 13:36, 31 October 2018 (UTC)
 * perhaps output dots as the bot runs. let me think about it. AManWithNoPlan (talk) 13:50, 31 October 2018 (UTC)

way to many places in the code would need changed. also likley to drop connection while running. wontfix AManWithNoPlan (talk) 17:03, 22 December 2018 (UTC)

Do not touch any parameter with comments
In this edit.
 * Removed/touched a parameter with a comment, which should "block out" the bot from touching it. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t)  Josve05a  (c) 09:30, 12 November 2018 (UTC)

I think more fixed now. AManWithNoPlan (talk) 16:50, 22 December 2018 (UTC)

Spanish title case
Note that the journal themselves capitalize, e.g. "Tomado de Joaquín Galarza, “Los códices mexicanos”, Arqueología Mexicana, Edición especial núm. 31, Códices prehispánicos y coloniales tempranos. Catálogo, pp. 6 - 9.". The main issue here is that you're trying to add the issue title to the journal. It really should just be


 * or similar.

Headbomb {t · c · p · b} 14:09, 21 December 2018 (UTC)


 * Just because the original source has one style does bot mean we follow it. Thouhts? AManWithNoPlan (talk) 17:14, 21 December 2018 (UTC)


 * notabug Jounal titles in many styles are capitalized. No the bots fault that the template was used wrong. AManWithNoPlan (talk) 16:56, 22 December 2018 (UTC)

API: add &via= option (also what does &edit= do?)
In a call like, does edit=toolbar do anything? Because I'd like to have some ways to tell the bot that it was triggered via Draft article or citation expander, or similar. We might want to rename the parameter to allow something like This way we could give a summary like Headbomb {t · c · p · b} 03:43, 26 August 2018 (UTC)
 * Add: website. Removed parameters. You can use this bot yourself. Report bugs here. | Headbomb via User:Headbomb/citations.js
 * Add: website. Removed parameters. You can use this bot yourself. Report bugs here. | Headbomb via the citation expander
 * Add: website. Removed parameters. You can use this bot yourself. Report bugs here. | Headbomb via Template:Draft article
 * I wonder what the audience of this additional message would be? To most users, what is important is the content and motivation of an edit, rather than the circumstances in which an editor came to make it. If I have a clear understanding of the motivation for this change, I'll be able to consider the best way to implement it.  Martin  (Smith609 – Talk)  08:56, 27 August 2018 (UTC)
 * The goal is mostly to have a way to see where Citation bot is used from. How many of those edits were triggered by the web interface? How many were from user scripts and from which userscript, or how many from templates and which templates (and do any need updating)? How many were done via the Citation Expander gadget? It's not necessarily to have 'official' stats (it would be nice though), but knowing where the bot is used from is nice, and could let us give help to newbies that run into issues with the bot. Headbomb {t · c · p · b} 10:32, 27 August 2018 (UTC)
 * For example, was most likely triggered from Draft article, present on Draft:Lil ginger ale (we sadly can't feed who used the Template from the template because we don't have a   magicword/variable), but knowing it was triggered from the template means it has a fairly high chance of being used by a newbie, and was probably triggered by one of these people. So that lets us (or at least me) customize feedback to people. If I see someone doing something weird/unusual with the bot from Draft article vs Web Interface vs Gadget vs User Scripts, well you more or less have a continuum of likely noob vs likely noob/intermediate vs likely intermediate vs likely advanced user dealing with the bot. And you'd have an idea of who could have triggered the bot in that scenario. Headbomb {t · c · p · b} 10:45, 27 August 2018 (UTC)

mostly fixed up. Draft actually comes from several templates deep in https://en.wikipedia.org/w/index.php?title=Template:Automated_tools/core&action=edit AManWithNoPlan (talk) 21:52, 23 December 2018 (UTC)


 * what's the syntax? Headbomb {t · c · p · b} 22:25, 23 December 2018 (UTC)

Blacklisted URL
If the bot tries to "reformat" a blacklisted link (e.g. https://zenodo.org/record/1223952 /files/article.pdf to https://zenodo.org/record/1223952 the bot will not be able to save the edit. We should stop to reformat these URLs, in order to be able to edit such pages. Editing pages with existing links aren't stopped, but formatting them turns them in to new links - which are blacklisted. <span style="background: turquoise;font-family: 'Segoe Script', 'Comic Sans MS';">(t) Josve05a  (c) 07:06, 28 December 2018 (UTC)


 * https://github.com/ms609/citation-bot/pull/1142 AManWithNoPlan (talk) 14:05, 28 December 2018 (UTC)


 * A better approach would be to find what causes this blacklisting, and see if edit filters can't be tweaked to let Citation Bot work around them. Headbomb {t · c · p · b} 14:44, 28 December 2018 (UTC)


 * Given that there are multiple ever changing black lists that would be hard.  Awesome, but hard.  AManWithNoPlan (talk) 14:54, 28 December 2018 (UTC)

fixed for now. AManWithNoPlan (talk) 05:25, 29 December 2018 (UTC)

time out on Life extension
https://github.com/ms609/citation-bot/pull/1145 and https://github.com/ms609/citation-bot/pull/1144 and https://github.com/ms609/citation-bot/pull/1143 AManWithNoPlan (talk) 18:03, 28 December 2018 (UTC)

Update naldc.nal.usda.gov URLs

 * Make a WP:BOTREQ and someone can take care of this with AWB. Headbomb {t · c · p · b} 16:02, 29 December 2018 (UTC)


 * Yes please use BOTREQ for URL updates, but be careful using AWB it typically breaks archive URLs and/or doesn't undo previous archivals of the broken URL. -- Green  C  16:08, 29 December 2018 (UTC)


 * Example what is required. Job done. -- Green  C  18:00, 29 December 2018 (UTC)


 * one time focused tasks like this are not optimal for the citation bot. AManWithNoPlan (talk) 23:54, 29 December 2018 (UTC)
 * I meant, the job is done. It has been completed. Special:Search/insource:"naldc.nal.usda.gov/naldc/download.xhtml" shows zero hits. I posted the diff to illustrate for Headbomb what is required when modifying URLs - it's not a search-replace with AWB because that causes problems with archives. -- Green  C  00:16, 30 December 2018 (UTC)
 * Thank you for the fix and for the wayback "medication". Nemo 00:24, 30 December 2018 (UTC)

Request: Shove "additional information" stuff after the pipe in edit summaries
It would probably make more sense to shove "additional information" stuff after the pipe or if &via= and category mentions are implemented. Headbomb {t · c · p · b} 04:10, 26 August 2018 (UTC)
 * | Activated by Headbomb. You can use this bot yourself! Report bugs here.
 * | Activated by Headbomb on Category:Livestock stubs via the web interface. You can use this bot yourself! Report bugs here.
 * | Activated by Headbomb on Category:Livestock stubs via the citation expander. You can use this bot yourself! Report bugs here.


 * https://github.com/ms609/citation-bot/pull/1134 AManWithNoPlan (talk) 21:05, 23 December 2018 (UTC)
 * Via= can't really be implemented in any useful manner, since edit= is currently unused and all usages use edit=toolbar and the draft article uses edit=toolbar and draft does not directly include it (it is two templates deeper, so we can't even tag it as from draft). We have done what we can for now. AManWithNoPlan (talk) 23:03, 23 December 2018 (UTC)
 * How about the way I suggest above? Headbomb {t · c · p · b} 23:34, 23 December 2018 (UTC)
 * There’s no way to easily detect how it was run. We already specify category as different. We have improved user detection though.   AManWithNoPlan (talk) 23:57, 23 December 2018 (UTC)
 * Yes, but there is a way to recognize what is fed in &via=, or if a &via= is declared. Also, since it got archived, what's the syntax for via? Headbomb {t · c · p · b} 00:15, 24 December 2018 (UTC)
 * it does not exist. there’s no reliable way to do it.  We can detect category vs toolbar, but nothing else.  That is why edit= is not used.   AManWithNoPlan (talk) 00:28, 24 December 2018 (UTC)
 * Well that's what the request in via was about. To add support for &via=. Headbomb {t · c · p · b} 03:54, 24 December 2018 (UTC)
 * I know and we’ve done all we really can. Unless we have some way of actually getting reliable information (which we do not) there’s really no point to adding it.  AManWithNoPlan (talk) 04:08, 24 December 2018 (UTC)
 * What do you mean 'reliable information'? what's wrong with just displaying the information that's passed in &via=! That'd be the whole point of via. Headbomb {t · c · p · b} 04:54, 24 December 2018 (UTC)
 * we would need an approved list of options to choose from and not just accept random strings. AManWithNoPlan (talk) 04:59, 24 December 2018 (UTC)
 * I honestly doubt anyone would set it, since the toolbar and the citation toolset core that draft pulls information from both set toolbar. AManWithNoPlan (talk) 05:01, 24 December 2018 (UTC)
 * Why would we need a list of options / pre-approved stringers? 99%+ of usages would be from templates and scripts. Headbomb {t · c · p · b} 06:19, 24 December 2018 (UTC)
 * I think the pre-approved strings would serve as a kind of input sanitisation. Otherwise at some point you may need to check that you're not inserting junk or spam in edit summaries (where it's hard to remove). I don't know how important a concern this is, but it's not unreasonable to keep it mind. Nemo 10:08, 27 December 2018 (UTC)

fixed pipe added. AManWithNoPlan (talk) 19:38, 30 December 2018 (UTC)

Fix spacing in page/issue rages
If it is only long dashes and numbers and spaces then remove spaces.  Correct? AManWithNoPlan (talk) 21:54, 23 December 2018 (UTC)


 * Could be letters too, like A23 - A48. Convert/fix that to A23–A48. Headbomb {t · c · p · b} 22:02, 23 December 2018 (UTC)
 * that get dangerous could be junk like ii - iii, 5-7 or the evil look at pages 5 to seven and browse around pages in the early teens.....  I will think about how many letters to allow. AManWithNoPlan (talk) 22:06, 23 December 2018 (UTC)
 * I can start simple and move on from there. AManWithNoPlan (talk) 22:53, 23 December 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1135 AManWithNoPlan (talk) 23:01, 23 December 2018 (UTC)

miscellaneous
1. French journals like Archives de sciences sociales des religions do NOT capitalise

2. Why change page= to pages=, when the article is one page only?

3. The name of this website is HannahArendt.net


 * many style guides actually specify capitalization of Foreign journals independent of the what the journal itself is called.  It’s an odd thing.  Specific journals can be submitted for capitalization as needed. AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
 * pages vs. pages is odd. Jstor gives us a range and then we fix that and so it is temporarily a range of pages  AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
 * websites are not case-sensitve, but I can add a capitalization exception. the initial reference being a mix of a journal and a website confused the bot. AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1136 AManWithNoPlan (talk) 19:39, 24 December 2018 (UTC)

Do not add null
https://github.com/ms609/citation-bot/pull/1137 and https://github.com/ms609/citation-bot/pull/1138 AManWithNoPlan (talk) 00:13, 27 December 2018 (UTC)

When journals ends in 'Des', don't uncapitalize
https://github.com/ms609/citation-bot/pull/1146 AManWithNoPlan (talk) 21:36, 28 December 2018 (UTC)

Prioritize DOI > Bibcode > Arxiv when fetching journal information
If arXiv has a doi then process that before the rest of the record AManWithNoPlan (talk) 18:52, 27 December 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1140 AManWithNoPlan (talk) 19:04, 27 December 2018 (UTC)

Bot adds deprecated class= to Template:Citation
I see that the template documentation has changed again. AManWithNoPlan (talk) 18:25, 29 December 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1148 AManWithNoPlan (talk) 18:51, 29 December 2018 (UTC)

Messes with a pubmed it shouldn't mess with
https://github.com/ms609/citation-bot/pull/1149 AManWithNoPlan (talk) 04:40, 30 December 2018 (UTC)

Capitalize after (
https://github.com/ms609/citation-bot/pull/1150 AManWithNoPlan (talk) 17:11, 30 December 2018 (UTC)

More JSTOR support
https://github.com/ms609/citation-bot/pull/1154 AManWithNoPlan (talk) 04:24, 1 January 2019 (UTC)

ePrint
See also User_talk:Citation_bot/Archive_10 (e?Prints should read e-?Prints?) Headbomb {t · c · p · b} 18:03, 31 December 2018 (UTC)
 * will add ePrint, e-Prints, and e-Print to the existing ePrints. AManWithNoPlan (talk) 18:22, 31 December 2018 (UTC)

Bot generates template errors
Don’t convert to cite arXiv when incompatible parameters exist. AManWithNoPlan (talk) 18:26, 31 December 2018 (UTC)
 * https://github.com/ms609/citation-bot/pull/1153 AManWithNoPlan (talk) 23:47, 31 December 2018 (UTC)

Request: Better Citoid like capabilities
Investigating. AManWithNoPlan (talk) 21:41, 23 December 2018 (UTC)
 * If the Citation bot can easily read & process the HTML Meta tags for the page, it could sometimes do better for these cases.
 * For the example Nature URL, prism.doi, dc.identifier, DOI meta tags all provide the doi, though the first two give doi:....
 * I think the prism. tags may also work for some newspapers.
 * Scripts/Perl scripts/hdump-head.pl was my ugly attempt at this. Hope this is useful. RDBrown (talk) 08:13, 28 December 2018 (UTC)
 * can you point to a single page where prism works better? Also, we need php, not perl generally. AManWithNoPlan (talk) 19:42, 30 December 2018 (UTC)


 * I needed to fix the perl script for https after that, this is the output for the Nature url Josve05a referenced

Page title: @0.0.0 "Books in brief | Nature" msapplication-tilecolor #940720 dc.identifier  doi:10.1038/546031a wt.z_primary_atype     Books and Arts robots noarchive dc.publisher   Nature Publishing Group wt.cg_s Article access_endpoint https://www.nature.com/platform/readcube-access theme-color    #940720 twitter:description    Barbara Kiser reviews five of the week's best science picks. wt.cg_n Nature dc.date 2017-05-31 doi    10.1038/546031a wt.template    oscar twitter:card   summary twitter:title  Books in brief prism.startingpage     31 twitter:site   @naturenews dc.creator     Barbara Kiser prism.rightsagent      permissions@nature.com dc.rights      ©2019 Macmillan Publishers Limited. All Rights Reserved. prism.url      https://www.nature.com/articles/546031a description    Books & Arts wt.page_categorisation Article_HTML wt.z_bandiera_abtest   a prism.issn      1476-4687 prism.issn     1476-4687 dc.subject     Climate sciences dc.subject     Engineering dc.subject     Medical research dc.subject     Nuclear physics dc.description Barbara Kiser reviews five of the week's best science picks. dc.type Books and Arts wt.z_cg_type   Nature Research Journals wt.z_subject_term      Climate sciences;Engineering;Medical research;Nuclear physics dc.copyright   2017 Nature Publishing Group prism.copyright 2017 Nature Publishing Group dc.language    En dc.title        Books in brief viewport       width=device-width,initial-scale=1.0,maximum-scale=2.5,user-scal able=yes dc.rightsagent permissions@nature.com prism.volume   546 wt.z_subject_term_id   climate-sciences;engineering;medical-research;nuclear-physics twitter:image  https://media.springernature.com/full/nature-static/assets/v1/image-assets/546031a-i1.jpg dc.source      Nature 2017 546:7656 msapplication-tileimage /static/images/favicons/nature/favicon-144x144.3e61d1f755.png prism.number   7656 prism.publicationdate  2017-05-31 journal_id     nature access Yes prism.doi      doi:10.1038/546031a prism.section  Books and Arts dc.format      text/html prism.publicationname  Nature
 * 1)    https://www.nature.com/articles/546031a#bk4

Barbara Kiser[au] 2017-05-31[dp] Books in brief[ti] Nature[ta] 546[vi] 7656[ip] 31[pg]
 * author=Barbara Kiser |date=2017-05-31 |title=Books in brief |journal=Nature |volume=546 |issue=7656 |pages=31 |doi=10.1038/546031a

I'm not a PHP programmer, but this StackOverflow answer may be useful, if you're not already retrieving the meta tag data. I think PRISM may include the Dublin Core dc. tags as a subset, but the BMJ & maybe the Oxford journals also add useful citation_ tags. dc.contributor Gordon C S Smith dc.contributor Jill P Pell dc.identifier  10.1136/bmj.327.7429.1459 citation_title Parachute use to prevent death and major trauma related to gravitational challenge: systematic review of randomised controlled trials citation_public_url    https://www.bmj.com/content/327/7429/1459 citation_mjid  bmj;327/7429/1459 citation_lastpage      1461 citation_doi   10.1136/bmj.327.7429.1459 citation_section       Hazardous journeys citation_article_type  Other citation_pmid  14684649
 * Hope this is useful RDBrown (talk) 12:34, 1 January 2019 (UTC)

I have submitted code to find the doi. https://github.com/ms609/citation-bot/pull/1156 AManWithNoPlan (talk) 00:58, 2 January 2019 (UTC)

Smarter duplicate handling
This is not actually a bug. The bot leaves the citation unchanged in its current method. AManWithNoPlan (talk) 02:38, 3 January 2019 (UTC)


 * Not saying it's a bug, but it would be an improvement. Headbomb {t · c · p · b} 13:40, 3 January 2019 (UTC)

weird ResearchGate thing
That is actually the correct information. Hard to deal with people who do not know how to spell when inputting data! AManWithNoPlan (talk) 19:01, 5 January 2019 (UTC)


 * Correct information? There is no journal named 'peprint' out there, and that doesn't seem to be anywhere on the RG page either. Is this GIGO? Headbomb {t · c · p · b} 19:05, 5 January 2019 (UTC)
 * yes indeed it is correct. That’s the journal the author entered.  Obviously GIGO. AManWithNoPlan (talk) 19:14, 5 January 2019 (UTC)

TY - BOOK AU - Petit, Jean-Pierre PY - 2016/07/04 SP - T1 - Schwarzschild 1916 seminal paper revisited : A virtual singularity JO - peprint ER -

Anyway, close this one then. No need to code an exception for such uncommon GIGO. Headbomb {t · c · p · b} 19:33, 5 January 2019 (UTC)
 * wontfix as you said. Would have been a lot funnier if they had spelled it will one more e.  AManWithNoPlan (talk) 19:34, 5 January 2019 (UTC)


 * PEE PINTS FOAR EVRYONE!! Headbomb {t · c · p · b} 19:38, 5 January 2019 (UTC)

Timeout on Red imported fire ant
Blocked zenodo dot org url is trying to be added. Not sure how or why. AManWithNoPlan (talk) 23:29, 5 January 2019 (UTC)
 * https://github.com/ms609/citation-bot/pull/1171 AManWithNoPlan (talk) 00:51, 6 January 2019 (UTC)
 * https://github.com/ms609/citation-bot/pull/1175 AManWithNoPlan (talk) 02:39, 6 January 2019 (UTC)

CAPS: Fluids and Barriers of the CNS
https://github.com/ms609/citation-bot/pull/1173 AManWithNoPlan (talk) 02:40, 6 January 2019 (UTC)

URLs containing an ISSN-DOI replaced with link to incorrect article in a different journal

 * This mostly happens with Wiley's "fake DOI" ISSN links (which are often rather spammy by the way, as in this article) and can be conclusively solved only by actually resolving DOI links. Nemo 10:32, 6 January 2019 (UTC)
 * Look where the incorrect reference goes. Even though the bot put "journal=Genes, Brain and Behavior", it was to an article in a completely different journal that had "Genes, Brain and Behavior" as title. It didn't go to one of Wiley's URLs at all. Wiley doesn't use these fake DOI URLs any more, although these generally are still functional but redirect to the new (non-DOI) URLs. All that the bot should have done was replace the "fake DOI URL" with the new URL. --Randykitty (talk) 10:45, 6 January 2019 (UTC)
 * That's a problem with fake DOIs that resolve. AManWithNoPlan (talk) 15:14, 6 January 2019 (UTC)
 * I could probably add code to detect DOIs in the form of 10.xxxxx/(ISSN)xxxx-xxxx which are obviously just an ISSN. AManWithNoPlan (talk) 15:21, 6 January 2019 (UTC)
 * Thanks for maintaining this invaluable tool. BTW, I'm still curious why the bot took those fake DOI links and arrived at an old book review, mixing up the review title and the journal name... --Randykitty (talk) 15:35, 6 January 2019 (UTC)
 * The Bot took the journal title which was in the title parameter and did a PMC search and found an exact match and went with it. We do have rare false positives like this. AManWithNoPlan (talk) 15:55, 6 January 2019 (UTC)
 * I see. Yes, that must be rare :-) Thanks again. --Randykitty (talk) 16:06, 6 January 2019 (UTC)

Once this is accepted and pushed to wikipedia, the bug will be gone. https://github.com/ms609/citation-bot/pull/1177 AManWithNoPlan (talk) 19:55, 6 January 2019 (UTC)

Timeout at Edward M. Fram
notabug I ran it successfully and there were no changes to make. AManWithNoPlan (talk) 04:31, 7 January 2019 (UTC)