Wikisource:Bot requests
Move all subpages of Who's Who in the Far East to use title case
[edit]I was informed by User:Beeswaxcandle that I should use title case instead of all caps in article names. So I request to move all subpages of Who's Who in the Far East to use title case. Although I can use a bot to move it myself, that would leave tons of redirects for admins to delete. But if an admin can easily batch-delete a list of pages, I can move it myself and then provide the list of pages to delete. I'm sorry for the inconvenience. Thanks, --Stevenliuyi (talk) 08:58, 6 May 2021 (UTC)
- @Stevenliuyi: Please review the list at Wikisource:Bot requests/sandbox. I notice that there is at least one English name that needs to be fixed, and the Chinese names didn't convert on the regex that I used. Would you fix or create the target (only) in the list in the pair list, and I will get it done. No need to fix those that are broken though you should fix the previous/next links of the articles either side. To note that as I did for your other work, I will look to get a work specific template in place, though will do that afterwards. — billinghurst sDrewth 13:10, 24 May 2021 (UTC)
- I suppose that I really to want to ensure that the Chinese names are capitalised properly. — billinghurst sDrewth 02:57, 25 May 2021 (UTC)
- @Stevenliuyi and @Billinghurst: Has this request been actioned (i.e. can it be closed as resolved)? Xover (talk) 10:34, 10 April 2022 (UTC)
- @Stevenliuyi: Please see Billinghurst's request (above) for quality control of the list of targets in Wikisource:Bot requests/sandbox. They have done the legwork to prepare for the move, but it is unable to progress until you've checked and corrected the target page names. Xover (talk) 05:33, 3 September 2022 (UTC)
- @Stevenliuyi: This is blocked on your input here. Xover (talk) 10:19, 7 November 2023 (UTC)
- @Stevenliuyi: Please see Billinghurst's request (above) for quality control of the list of targets in Wikisource:Bot requests/sandbox. They have done the legwork to prepare for the move, but it is unable to progress until you've checked and corrected the target page names. Xover (talk) 05:33, 3 September 2022 (UTC)
Wikidata bulk edit
[edit]I made a query for works on enWS that have WD items with no "instance of" statement. The criteria I used are:
- Pages in mainspace
- No redirects or disambiguation pages (this includes Versions and Translations btw)
- Does not contain a forward slash in the page name (in order to exclude subpages)
- Is linked to Wikidata, and linked Wikidata item does not have a P31 statement
This query returns 13889 results, which is more than even QuickStatements can handle. Would it be possible for a bot to update these Wikidata items with P31=Q3331189 (instance of = version, edition, or translation)?
Thanks :) —Beleg Tâl (talk) 13:22, 1 November 2021 (UTC)
- I think we could be more specific for certain groups, e.g I have addressed "Presidential Radio Address" articles as "instance of speech". There are several groups of articles that can be identified and then addressed with QuickStatements. After that, the bot can be run on what is left. Mpaa (talk) 23:13, 1 November 2021 (UTC)
- @Mpaa: Except they are editions as we host them, the speech would be the parent to the item, per d:WD:Books as there may be other published editions of the same speech. — billinghurst sDrewth 12:17, 5 September 2022 (UTC)
- @Billinghurst I see. I saw other were linked that way and I followed along. If it is not correct, it should be cleaned up but I do not master wikidata tools enough to write a bot for it. Mpaa (talk) 21:34, 5 September 2022 (UTC)
- We desperately need better Wikidata tools (so we're not dependent on Billinghurst to be on eternal vigilance here). But the current gadget we have for this is loaded from some user's personal page on Russian Wikisource (which is kinda iffy in itself these days), and its code is completely incomprehensible. If anybody knows of or runs across good API docs for how to talk to Wikidata I'd be very interested. As far as I can tell, the only existing API is the main MW:API with some very minor additions for WD, and that's way way too painful to use for our purposes. Xover (talk) 06:15, 6 September 2022 (UTC)
- @Xover: Maybe we should just be bold and create a phabricator task and see where we go. We probably should have put this into the desired toys to be built for 2023, though we have missed that boat as it is currently in final stages of voting (I think). — billinghurst sDrewth 05:40, 22 February 2023 (UTC)
- We desperately need better Wikidata tools (so we're not dependent on Billinghurst to be on eternal vigilance here). But the current gadget we have for this is loaded from some user's personal page on Russian Wikisource (which is kinda iffy in itself these days), and its code is completely incomprehensible. If anybody knows of or runs across good API docs for how to talk to Wikidata I'd be very interested. As far as I can tell, the only existing API is the main MW:API with some very minor additions for WD, and that's way way too painful to use for our purposes. Xover (talk) 06:15, 6 September 2022 (UTC)
- @Billinghurst I see. I saw other were linked that way and I followed along. If it is not correct, it should be cleaned up but I do not master wikidata tools enough to write a bot for it. Mpaa (talk) 21:34, 5 September 2022 (UTC)
- @Mpaa: Except they are editions as we host them, the speech would be the parent to the item, per d:WD:Books as there may be other published editions of the same speech. — billinghurst sDrewth 12:17, 5 September 2022 (UTC)
- User:Beleg Tâl why not just do it with Petscan itself, from memory it could additions. Also note that there is the interwiki Petscan: for these. — billinghurst sDrewth 12:14, 5 September 2022 (UTC)
Comment wondering whether we need to chip out components of this task. For example, something like petscan:23959659 shows works using {{Act of Congress}} which would not be edition, and would instead by another item, and they also have components that could have other elements added through QuickStatements. Yes, this will still need a large slab of works that need version, edition or translation (Q3331189) added, though at least it will allow for something less than the blunderbuss approach. — billinghurst sDrewth 05:24, 27 February 2023 (UTC)
Copy the proofread text from Index: The last man (Second Edition 1826 Volume 1).djvu to Index:The last man vol 1.djvu
[edit]The three volumes of The Last Man only have a different title page between the first and second edition, could the proofread text of the three-volumes of the second edition be copied to the scans of the first edition. Languageseeker (talk) 23:29, 16 July 2022 (UTC)
- @Mpaa If it is OK to copy also the Page status, better wait for all 3 vols to be validated. Mpaa (talk) 13:53, 18 July 2022 (UTC)
- Makes Sense. Languageseeker (talk) 13:49, 22 July 2022 (UTC)
Collected works of Ibsen moves
[edit]- Index:The collected works of Henrik Ibsen (Volume 5).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 5).djvu
- Index:The collected works of Henrik Ibsen (Volume 6).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 6).djvu
- Index:The collected works of Henrik Ibsen (Volume 7).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 7).djvu
- Index:The collected works of Henrik Ibsen (Volume 8).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 8).djvu
- Index:The collected works of Henrik Ibsen (Volume 9).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 9).djvu
- Index:The collected works of Henrik Ibsen (Volume 10).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 10).djvu
- Index:The collected works of Henrik Ibsen (Volume 11).djvu --> Index:The collected works of Henrik Ibsen (Heinemann Volume 11).djvu
Each Index above needs to have (a) the File on Commons renamed; (b) the Index on Wikisource moved; (c) all Page(s) moved. The issue is that we have an incomplete (UK) Heinemann edition set and a partial (US) Scribner's edition set, with further scans from the Scribner's on Commons, and the two sets of editions ought to be disambiguated before any further confusion or ensues.
Each File / Index / Page should be renamed so that "Heinamann: is added in the position to make the following change:
- "The collected works of Henrik Ibsen (Volume 5).djvu" --> "The collected works of Henrik Ibsen (Heinemann Volume 5).djvu"
The template {{The collected works of Henrik Ibsen}} is the only item linked to these Indices right now, and therefore it should be updated following the change(s). None of the Pages seem to be transcluded, so no mainspace editing should be required. --EncycloPetey (talk) 18:51, 13 July 2024 (UTC)
- @EncycloPetey: Sigh. I went to move the files and found the following:
- File:The collected works of Henrik Ibsen (Heinemann Volume 5).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 6).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 7).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 8).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 9).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 10).djvu
- File:The collected works of Henrik Ibsen (Heinemann Volume 11).djvu
- That is, Languageseeker apparently uploaded different scans of vols. 5, 6, 8, and 9—but not vols. 7, 10, and 11—using that naming schema. And since Commons policy doesn't permit overwriting these we're kind of stuck on that.Maybe we could put these at…
- File:The collected works of Henrik Ibsen, vol. 5 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 6 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 7 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 8 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 9 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 10 (Heinemann).djvu
- File:The collected works of Henrik Ibsen, vol. 11 (Heinemann).djvu
- … possibly along with moving vols. 1–4 to the same schema? --Xover (talk) 11:56, 21 July 2024 (UTC)
- Gah, this is an even bigger mess than I thought. Can we not request an admin to move the low-quality scans for 5, 6, 8, 9 out of the way, and then move the good scans into the now vacant names? We've had to deal with complex moves like that before, and from what I can see, none of those four files have an Index page or other bits associated with them requiring a move. Otherwise, we'd not only have to make the move you suggest but also move our current scans on volumes 1, 2, 3, 4, and 12, as well as all their pages, to make the naming convention consistent. --EncycloPetey (talk) 16:39, 21 July 2024 (UTC)
- You'd need to make a move request on Commons, and hope you get an admin to look at it within a reasonable time plus to understand the point of the request (I am not certain they would accept this move reason without a lot of explanation). Most file moves are made by file movers, not admins, but they don't have the permissions for this kind of thing (they can technically suppress redirects, but aren't allowed to use the permission for this kind of move).BTW, while not trivial, bulk moving pages from one index to another isn't too bad with the pywikibot script Inductiveload made, and usually you can just run a search&replace by bot in mainspace to fix the transclusions. If you decide to go that route it should be fine (even if annoying). Xover (talk) 17:09, 21 July 2024 (UTC)
- Gah, this is an even bigger mess than I thought. Can we not request an admin to move the low-quality scans for 5, 6, 8, 9 out of the way, and then move the good scans into the now vacant names? We've had to deal with complex moves like that before, and from what I can see, none of those four files have an Index page or other bits associated with them requiring a move. Otherwise, we'd not only have to make the move you suggest but also move our current scans on volumes 1, 2, 3, 4, and 12, as well as all their pages, to make the naming convention consistent. --EncycloPetey (talk) 16:39, 21 July 2024 (UTC)
┌─────────┘
@EncycloPetey: Ok, to get some progress on this I propose we do as follows.
1. Move the File:s as follows:
2. Optionally upload a new DjVu of vol. 1 to make the set consistent.
3. Move all the Index: pages from the old file name to the new one.
4. Move all the Page: pages from the old file names to the new ones.
5. Replace all the links (including transclusions) to the old file names in mainspace so they point at the new file names.
All these new file names are currently unused so we can just move them directly there without needing multiple requests to admins at Commons. Making file names consistent within a set is one of the permissible move reasons for file movers (i.e. I can make the moves myself). And since we don't need to old file names for anything we can leave redirects behind which widens the scope of what I'm permitted to do with my filemover rights.
Moving the Index: pages will be done manually (12x), but for Page: pages and references/transclusions in mainspace I should be able to do it by bot for each file (i.e. I have to set stuff up per file, but for that file it will be two bot tasks that takes care of each Page: and each replacement). It's not a trivial amount of work, but I can do all of it myself and I can do it one file at a time, as and when I have time, until we have them all moved.
Thoughts? --Xover (talk) 07:57, 13 August 2025 (UTC)
- @PWidergren: who has been working on these volumes. --EncycloPetey (talk) 17:33, 13 August 2025 (UTC)
- @EncycloPetey: @Xover:
- I will try to pay attention to what happens to this multi-volume work and, of course, defer to whatever you do. I might point out the vol. 13 from the Scribner's version has already been proofread once and needs find a place somewhere. Also there is this entry page The_Collected_Works_of_Henrik_Ibsen which would have to be updated, but do to a lack of understanding about bots, perhaps that is already in the mix. My major focus on Wikisource will continue to be contributing to Swedish Wikisource], but I will try to keep an eye on what happens here.
- PWidergren (talk) 12:09, 14 August 2025 (UTC)
Replace simple uses of override-contributor
[edit]Please make the following replacements in Category:Pages with override contributor type:
<override contributor pattern>is(\|\s*(contributor|section(\-|_|\s)author)\s*=\s*)?\|\s*override(\-|_|\s)(contributor|section(\-|_|\s)author)\s*=\s*.- Assume patterns are on a single line which otherwise contains only whitespace.
<override contributor pattern>\[\[Author:<author pattern>\|<author display pattern>\]\]→\| section\-author = <author pattern> \| section\-author\-display = <author display pattern><override contributor pattern>\[\[Author:<author 1 pattern>\|<author 1 display pattern>\]\]\s+and\s+\[\[Author:<author 2 pattern>\|<author 2 display pattern>\]\]→\| section\-author1 = <author 1 pattern> \| section\-author1\-display = <author 1 display pattern> \| section\-author2 = <author 2 pattern> \| section\-author2\-display = <author 2 display pattern><override contributor pattern>\[\[Author:<author pattern>\|<author display pattern>\]\], translated by \[\[Author:<translator pattern>\|<translator display pattern>\]\]→\| section\-author = <author pattern> \| section\-author\-display = <author display pattern> \| section\-translator = <translator pattern> \| <translator display pattern><override contributor pattern>\{\{anon\}\}→\| section\-author = anon
—CalendulaAsteraceae (talk • contribs) 08:15, 2 October 2024 (UTC)
US Supreme Court decisions with no license
[edit]This PetScan query lists 440 US Supreme Court decisions that are missing license tags. Would it be possible for a bot to add {{PD-EdictGov}} to each of these pages? —Beleg Tâl (talk) 21:29, 6 July 2025 (UTC)
- @Beleg Tâl: No it doesn't. It lists 7464 more or less random other texts. Xover (talk) 07:36, 9 August 2025 (UTC)
- @Xover: filtering by Category:United States Supreme Court decisions, though, we do get 460 pages that don't look like false positives. — Alien 3
3 3 11:03, 9 August 2025 (UTC)- Ok. And are we sure these are actually {{PD-EdictGov}}? The Opinion of the Court, concurrances, and dissents are, certainly, but the top level pages appear to be syllaby by the reporter and those are not generally PD. Xover (talk) 11:14, 9 August 2025 (UTC)
- @Beleg Tâl / @Beleg Âlt: I can probably (havn't tested, but assume so) add the template, but I'd like a double check that the licensing is correct first. Could you take a look? Xover (talk) 17:29, 12 August 2025 (UTC)
- Wouldn't the syllaby by the reporter be a work under their offical duties as a government employee and be {{PD-USGov}}. ToxicPea (talk) 17:53, 12 August 2025 (UTC)
- Nope. Xover (talk) 18:23, 12 August 2025 (UTC)
- Why not? ToxicPea (talk) 18:24, 12 August 2025 (UTC)
- The reporter is not necessarily a government employee and the reporting (authoring of the syllabus) may not be within the scope of their duties if they are (see w:Reporter of Decisions of the Supreme Court of the United States). Xover (talk) 19:14, 12 August 2025 (UTC)
- wait, are we saying that these are potential copyvio? —Beleg Âlt BT (talk) 20:01, 12 August 2025 (UTC)
- The reporter is not necessarily a government employee and the reporting (authoring of the syllabus) may not be within the scope of their duties if they are (see w:Reporter of Decisions of the Supreme Court of the United States). Xover (talk) 19:14, 12 August 2025 (UTC)
- Why not? ToxicPea (talk) 18:24, 12 August 2025 (UTC)
- Nope. Xover (talk) 18:23, 12 August 2025 (UTC)
- {{PD-EdictGov}} says that "judicial decisions" are in the public domain - so if these are not {{PD-EdictGov}}, then they should not be in Category:United States Supreme Court decisions, unless I am missing something —Beleg Âlt BT (talk) 20:00, 12 August 2025 (UTC)
- @Beleg Âlt: Supreme Court cases consist of the "Opinion of the Court", which is the main thing; a common-but-not-always "Concurrence by …" one or more of the other judges who agree with the Opinion but want to ge their own points in; and a also common-but-not-always "Dissent by …" someone on the court that disagrees. And, finally, there is often a short summary of the case known as the syllabus of the case. The top level pages in the search above are the syllaby (the ones I checked, anyway), and links on to the Opinions, Concurrences, Dissents, etc. as subpages. All the Opinions, Concurrences, and Dissents are written by judges on the court and are unquestionably, by the court's own repeated rulings, public domain. The syllabus, however, is written by a reporter, not a judge, and so may be eligible for copyright. There's an official reporter that is paid by the government and whose duties it is to publish this in the United States Reports. The syllaby appearing here you can assume are public domain unless there's evidence of some extraordinary circumstance. But there are also unofficial reporters who make their living by adding better summaries, notes, etc. and publishing them (see Reporter of Decisions of the Supreme Court of the United States). The additional parts added by these are not public domain.So… Before we go bot-adding {{PD-EdictGov}} to the ~450 texts in this category we need to have at least some idea where these came from and whether the syllaby are actually public domain or not. I'll bet these are all BenchBot imports, which may or may not count in their favour. Xover (talk) 05:11, 13 August 2025 (UTC)
- Should we move this discussion to WS:CV then, do you think? —Beleg Âlt BT (talk) 14:11, 13 August 2025 (UTC)
- @Beleg Tâl / @Beleg Âlt: Not neessarily, allthough in-depth research on these (and all of BenchBot's imports, if that's what these are) would be good to get done at some point (not to mention scan-backing this stuff to the actual United States Reports volumes). I just need someone to do some due dilligence on these to exclude obvious signs of them coming from something like the Lawyer's Edition (or Westlaw, or…) and at least some evidence making it likely that they're actually covered by {{PD-EdictGov}} before I go bot-adding the template to them. I just don't have the spare cycles to do it myself. Xover (talk) 15:29, 13 August 2025 (UTC)
- Should we move this discussion to WS:CV then, do you think? —Beleg Âlt BT (talk) 14:11, 13 August 2025 (UTC)
- @Beleg Âlt: Supreme Court cases consist of the "Opinion of the Court", which is the main thing; a common-but-not-always "Concurrence by …" one or more of the other judges who agree with the Opinion but want to ge their own points in; and a also common-but-not-always "Dissent by …" someone on the court that disagrees. And, finally, there is often a short summary of the case known as the syllabus of the case. The top level pages in the search above are the syllaby (the ones I checked, anyway), and links on to the Opinions, Concurrences, Dissents, etc. as subpages. All the Opinions, Concurrences, and Dissents are written by judges on the court and are unquestionably, by the court's own repeated rulings, public domain. The syllabus, however, is written by a reporter, not a judge, and so may be eligible for copyright. There's an official reporter that is paid by the government and whose duties it is to publish this in the United States Reports. The syllaby appearing here you can assume are public domain unless there's evidence of some extraordinary circumstance. But there are also unofficial reporters who make their living by adding better summaries, notes, etc. and publishing them (see Reporter of Decisions of the Supreme Court of the United States). The additional parts added by these are not public domain.So… Before we go bot-adding {{PD-EdictGov}} to the ~450 texts in this category we need to have at least some idea where these came from and whether the syllaby are actually public domain or not. I'll bet these are all BenchBot imports, which may or may not count in their favour. Xover (talk) 05:11, 13 August 2025 (UTC)
- Wouldn't the syllaby by the reporter be a work under their offical duties as a government employee and be {{PD-USGov}}. ToxicPea (talk) 17:53, 12 August 2025 (UTC)
- @Beleg Tâl / @Beleg Âlt: I can probably (havn't tested, but assume so) add the template, but I'd like a double check that the licensing is correct first. Could you take a look? Xover (talk) 17:29, 12 August 2025 (UTC)
- Ok. And are we sure these are actually {{PD-EdictGov}}? The Opinion of the Court, concurrances, and dissents are, certainly, but the top level pages appear to be syllaby by the reporter and those are not generally PD. Xover (talk) 11:14, 9 August 2025 (UTC)
- @Xover: filtering by Category:United States Supreme Court decisions, though, we do get 460 pages that don't look like false positives. — Alien 3
Remove spurious breaks from Once a Week (magazine) section parameters
[edit]A number of Once a Week articles have a <br /> following the section parameter. I'd like to fix this. To that end, could a bot operator please make the following replacement in subpages of Once a Week?
\|\s*section\s*=\s*([^\n\|\<]*)\<br \/\>(\n) →
| section = $1$2
—CalendulaAsteraceae (talk • contribs) 21:47, 29 August 2025 (UTC)
Add missing Source parameters to indexes
[edit]There are over 400 PDF and DJVU indexes that don't have the Source parameter filled out. I would like for the following replacements to be made:
- In results for insource:/\|Source=\n/ intitle:/\.pdf/, replace
\|Source=\nwith|Source=pdf\n - In results for insource:/\|Source=\n/ intitle:/\.djvu/, replace
\|Source=\nwith|Source=djvu\n
Thanks! —CalendulaAsteraceae (talk • contribs) 02:10, 9 September 2025 (UTC)
Migrate two-parameter invocations of Template:RunningHeader
[edit]Following up on Wikisource:Scriptorium/Archives/2024-01#Updates to Template:RunningHeader, please make the following replacement in the pages in Category:Running headers with two entries:
\{\{([Rr]unning[ ]?[Hh]eader|[Rr][Hh]|[Rr]unningFooter|[Rr]f)\|([^\{\}\|\n]*)\|([^\{\}\|\n]*)\}\} → {{rh|$2|$3|}}
—CalendulaAsteraceae (talk • contribs) 19:24, 15 September 2025 (UTC)
- The proposed regex encompasses some edge cases, so an entirely automated process will not be sufficient to migrate all uses as intended. In a sample batch undertaken using AWB recently, The above regex would migrate some misformed running headers, omitting page numbers present in the scan, or which had failed to include a blank first parameter need to match the scan. If the proposal was to allow tool-assisted, (with manual confirmation of egde cases) The concern is mitigated. ShakespeareFan00 (talk) 20:54, 21 September 2025 (UTC)
In Index:For Women Scotland Ltd v The Scottish Ministers (2025, UKSC).pdf the paragraph numbers are formatted with {{pn}} followed by a "." followed by a " ". Please replace the " " with {{gap}}. ToxicPea (talk) 00:46, 15 October 2025 (UTC)
