Wikisource:Bot requests

From Wikisource

Jump to: navigation, search
Community pages Bot requests archives (current)→
Shortcut:
WS:BOTR
WS:BR
This page allows users to request that an existing bot accomplish a given task. Note that some tasks may require that an entirely new bot or script be written. This is not the place to ask for help running or writing a bot.

A bot operating performing a task should make note of it so that other bots don't attempt to do the same. Tasks that are permanently assigned or scheduled for long-term execution are listed on Persistent tasks.

Contents

[edit] Unassigned requests

[edit] Interlinking sections of Alabama State Constitution of 1901

Within the text of this constitution, numerous amendments refer to previous amendments. EG. Here, at the bottom is "amendment III [3] to the Constitution of Alabama", which should link back to here. Also, things like "section 260 of article XIV of the Constitution", should link back to the appropriate section. Note that later on the Roman numerals are discarded. Is this possible to do (relatively easyly) with a bot? 68.39.174.238 22:50, 6 July 2008 (UTC)

There are many variations, and the links sometimes refer to the constitution and sometimes to the amendment:
  • articles: "Article X, Section 10";
  • sections: "section 10 of this /Constitution|article/", "section 10", "section 10.09", "sections 10.13, 10.14 and 10.15", "Section 10 1/2", "section 10 of article X", "section 10, article X", "section 10 of article 10", "Section 10. SECTION 10";
  • amendments: "amendment 10", "amendment X", "third amendment";
  • links to other texts: "Title 22, section 189, Code of Alabama 1940", "Code of Alabama, Title 22, section 189", "section 265 of Title 37 of the Code of Alabama of 1940";
  • exceptions: "...title to that certain sixteenth section of school lands described as follows: section 16, township 4 south,...".
A script could do most of the work, but it be very heuristic. It would require a human willing to carefully review the changes to make sure they're in the correct context, and possibly search through the text for missing links. If you're willing to do this followup work, Pathosbot can take care of this task. —{admin} Pathoschild 13:07:26, 02 October 2008 (UTC)

[edit] Index linking

Would it be possible to run a bot to check WhatLinksHere, and if it finds a Wikisource: index as listed at Wikisource:Works, and the current |previous= parameter is empty, then it inserts a back-link to the index? It would help tremendously, especially with things like poetry which are going to be a pain in the ass to get Indexes running. Sherurcij Collaboration of the Week: Author:Augustus John Cuthbert Hare 05:32, 19 February 2008 (UTC)

I suggest implementing {{indexes}}, since using the "previous" parameter provides incorrect metadata. See my proposal on the Scriptorium. —{admin} Pathoschild 23:10:09, 15 May 2008 (UTC)

[edit] Normalize US patents

US patents have been added with various naming conventions and formats, which should be synchronized. —{admin} Pathoschild 06:11:53, 08 May 2008 (UTC)

[edit] Template:Indent

Migrate all use of the old single parameter invocation of {{Indent}} to either the new calling convention, or some other approach. John Vandenberg (chat) 01:01, 29 May 2008 (UTC)

Could you explain the new calling convention? The template page still advocates a single-parameter invocation. —{admin} Pathoschild 01:17:40, 29 May 2008 (UTC)
Documented on the template page. The other approach is to replace these invocations with <poem> . John Vandenberg (chat) 01:32, 29 May 2008 (UTC)
I see you changed "{{indent|number}} text" to "{{indent|text|number}}". Do you have any objection to using "{{indent|number|text}}" instead, to maintain consistency and line up the indentation amounts and texts? The bot can easily correct any pages using either old format. —{admin} Pathoschild 12:07:37, 04 July 2008 (UTC)
"number" is currently optional, and I have converted quite a few to the new syntax, and have been like that for quite a while, so I would rather not have the syntax change, as the current template is what is called when old revisions are viewed. Most cases where a number is specified would be better served with <poem>. John Vandenberg (chat) 12:21, 4 July 2008 (UTC)
I am not sure of the status of this job, and its specific requirements. Is it to be <poem> or modification of {{indent}}. -- billinghurst (talk) 13:29, 15 March 2009 (UTC)

[edit] Cut up formated pages

All the remaining Easton's Bible Dictionary articles (maybe 90% by count) are formated and on these 6 pages below. They need to be cut up and put in separate pages. Can a bot do it? --Carlaude (talk) 15:56, 21 August 2008 (UTC)

[edit] Importing text from DJVU to pages

Texts layers now underlying.

-- billinghurst (talk) 16:25, 26 September 2009 (UTC)

[edit] Updating CIA World Fact Book pages

Would it be possible for a botop to duplicate this format and create new pages with 2008 information? Thanks. Stepshep (talk) 22:50, 29 September 2008 (UTC)

I'd like to take a crack at it. With 2009 info, of course. :-) I'll experiment with coding this, assuming someone else hasn't already. --LarryGilbert (talk) 19:23, 14 November 2009 (UTC)
Go for it. I also note that recently a number of images were removed from the earlier version, so we may just wish to be aware of those links. billinghurst (talk) 23:19, 14 November 2009 (UTC)

[edit] Basic OCR Fix

Could somebody run a bot to replace all instances of tiie or Tiie or TIIE with "The", with the same capitalisation? Google tells me, there are well over a hundred such instances, all from OCR-ed texts. Sherurcij Collaboration of the Week: Author:Romain Rolland. 22:06, 23 March 2009 (UTC)

Yes check.svg Done I think I got them all, but we'll see when special:search catches up. -Steve Sanbeg (talk) 21:19, 20 April 2009 (UTC)

nope, still more than 400 "tiie"s on WS Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 12:05, 15 September 2009 (UTC)
Yes check.svg Done tiie, need recheck


Hgures/hgures = figures Sherurcij Collaboration of the Week: Author:Carl Jung. 05:21, 21 April 2009 (UTC)
WiUiam=William, over 400 errors on WS. Sherurcij Collaboration of the Week: Author:Carl Jung. 06:14, 21 April 2009 (UTC)
Yes check.svg Done OK -Steve Sanbeg (talk) 22:43, 21 April 2009 (UTC)
Nope, still 70 remaining Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 12:05, 15 September 2009 (UTC)
Didn't find 70, though updated those that my Google search discovered. Will need to db check. -- billinghurst (talk) 14:56, 15 September 2009 (UTC)
"bv" = "by", only on Page: namespace, not in main or other, and only in lowercase. There are nearly 5000 instances of this OCR typo on Page: namespace it seems. Sherurcij Collaboration of the Week: Author:Carl Jung. 19:10, 23 April 2009 (UTC)
Tentatively Yes check.svg Done only had about 500, not the extra order of magnitude. May need a rescan in a while. -- billinghurst (talk) 10:13, 16 September 2009 (UTC)
"tiiis" to "this", 85 instances[1] Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 12:03, 15 September 2009 (UTC)
tentative Yes check.svg Done . My google search only showed 67, these are done, though we should db check. -- billinghurst (talk) 14:38, 15 September 2009 (UTC)
Yes check.svg Done -- billinghurst (talk) 13:37, 3 October 2009 (UTC)
Yes check.svg Done where worthwhile, skipped a lot of big ugly works needing some splitting and cleanup -- billinghurst (talk) 13:37, 3 October 2009 (UTC)

[edit] Years of works

As I update many {{PD-old-70}} to show why PD in the USA, I sometimes move categories of years of works into the {{header}} with the parameter "year=", but I would like to ask if anyone can make a bot to do this task, such as converting [[Category:1900 works]] to | year = 1900 in the header. Manually changing these is time-consuming.--Jusjih (talk) 02:11, 14 October 2009 (UTC)

[edit] Swapping header templates

Is there any way to automate the swapping out of {{header}} and {{header2}} for {{Potus-eo}} for those Executive Order articles not already using the custom header template? The default parameters for minimal problems (I'm guessing) in such a conversion are...

{{Potus-eo
 | eo         = >4 or 5 digit EO #<
 | title      = Executive Order >same 4 or 5 digit EO #<
 | section    = 
 | year       = 
 | month      = 00
 | day        = 
 | fr-vol     = 
 | fr-page    = 
 | fr-year    = 
 | fr-month   = 
 | fr-day     = 
 | notes      = 
}}

... so whatever may currently exist in {{{section}}}, {{{notes}}}, etc. should transfer to like parameter in the new. George Orwell III (talk) 08:51, 26 November 2009 (UTC)

Pretty sure that we can get a bot to do the translation from one template to the other
Questions
  • do the blank values need to be in the template? Or can they be omitted?
  • is the >4 or 5 digit EO #< a variable that can be pulled from the work?
  • are there other values that may exist that can be grabbed from the file, eg. categories that complete the data?
billinghurst (talk) 05:18, 27 November 2009 (UTC)
  • Well the blank values would prevent any bangs from occuring but still makes the citation bar appear & be useful. Any existing {{{section}}} or {{{notes}}} values shoud transfer to the new template just fine.
  • The series of articles currently follow the page naming format "Executive Order 13388" for the most part (13388 being the 5 digit EO number in this case) but the {{{title}}} field can have "Executive Order 13388 - To order something for something by something" at times.
  • The template automatically adds the 2 relevant CATs using a helper template {{Potus-eo-data}}when applied. The "PD-USGov" one already exists in most of them but not all unfortunately.
  • and as a starting point, Executive Order 7532 and higher are to be swapped. I think we've done all those earlier than 1937 (lower than EO # 7532} manually by now anyway.
George Orwell III (talk) 19:44, 27 November 2009 (UTC)
Just adding a couple bits... the "notes" field needs to be blank if not supplied; the others do not need to be there. Every EO has a date though, and all from early 1936 and on have a Federal Register citation, so those parameters probably should be left in for easier editing later (if the bot is *really* good, it can pick up the FR citation and date (and subtitle for that matter) from the archives.gov executive order listing pages). The "title" value is defaulted to "Executive Order (num)" so, strictly speaking, it is not necessary -- but doesn't hurt. The "section" field is for the subtitle of the executive order; it is often already there but I have seen it sometimes be in the "notes" section. Sometimes the date is in the notes as well. Probably no way to really determine that though, and may be best just to leave it, and keep the same values for "notes" and "section" if they are already there.
Also, the template adds a DEFAULTSORT setting, so if the main body of the text already has one (never seen one yet) it should be removed. Carl Lindberg (talk) 01:57, 28 November 2009 (UTC)

[edit] Author:Foo

Wikisource:Caliphs - can somebody change [[Author:Foo]] to [[Author:Foo|Foo]]? Merci. Sherurcij Collaboration of the Week: Author:David Livingstone. 18:41, 15 October 2009 (UTC)

Yes check.svg Done . Didn't need a bot, just used the Custom Regex in the sidebar. You are aware that you can code those like [[Author:Foo|]]? -- billinghurst (talk) 23:07, 15 October 2009 (UTC)

[edit] Assigned requests

In other languages