User talk:Mpaa

From Wikisource
(Redirected from User talk:MpaaBot)
Jump to navigation Jump to search

(Archives index, Last archive) Welcome

Hello, Mpaa, and welcome to Wikisource! Thank you for joining the project. I hope you like the place and decide to stay. Here are a few good links for newcomers:

Carl Spitzweg 021-detail.jpg

You may be interested in participating in

Add the code {{active projects}}, {{PotM}} or {{CotW}} to your page for current wikisource projects.

You can put a brief description of your interests on your user page and contributions to another Wikimedia project, such as Wikipedia and Commons.

Have questions? Then please ask them at either


I hope you enjoy contributing to Wikisource, the library that is free for everyone to use! In discussions, please "sign" your comments using four tildes (~~~~); this will automatically produce your IP address (or username if you're logged in) and the date. If you need help, ask me on my talk page, or ask your question here (click edit) and place {{helpme}} before your question.

Again, welcome! — billinghurst sDrewth 12:00, 7 April 2011 (UTC)


Arthur Devine DoB[edit]

I don't suppose you remember where you got 1860 as the DoB of Author:Arthur Devine (diff). He's being enquired about at wikidata here where a second record indicates an Arthur Devine with an 1849 DoB. (An answer, if you have one, at Wikidata, would be appreciated; thx. Equally, 2013 was a long time ago.) --Tagishsimon (talk) 09:10, 2 July 2019 (UTC)

@Tagishsimon: 1849 per Page:The Catholic encyclopedia and its makers.djvu/73billinghurst sDrewth 21:43, 2 July 2019 (UTC)
updated here with refs on talk page, and there. — billinghurst sDrewth 22:40, 2 July 2019 (UTC)

Resolving fostered content issues in a table based structure...[edit]

https://en.wikisource.org/w/index.php?title=Page%3ABench_and_bar_of_Colorado_-_1917.djvu%2F80&type=revision&diff=9412925&oldid=8110232

Is this something that could be automated to be fixed by a bot or AWB? I'll hold off on manual repairs for a few days, to let you see it's feasible. ShakespeareFan00 (talk) 22:15, 3 July 2019 (UTC)

Please go ahead, I have nothing in place that I might quickly use.— Mpaa (talk) 08:40, 6 July 2019 (UTC)

Fine should be nested inside RH; AWB repairable?[edit]

https://en.wikisource.org/w/index.php?title=Page%3APalmore_v._State.pdf%2F5&type=revision&diff=9429246&oldid=9216687

Is there a regexp that could be applied to fix an issue like this en-masse? (either with a bot or by an AWB user)?

There are other DIV-span swaps that could also be dealt with en-masse.

DIV SPAN Swap repairs and solutions..
original replacement
smaller|<poem>
smaller block|<poem>
larger|<poem>
larger block|<poem>
{{smaller|{{c|
{{c|1={{smaller|

And so on , is there a page for listing this en-masse fixes?

ShakespeareFan00 (talk) 12:21, 12 July 2019 (UTC)

I spent a lot of time cleaning up these things. What I learned is that parsing nested templates is not easy and there is always an unforeseen corner case. And it is always more complex than this. E.g. if larger|<poem> is wrapped in another span-template, the replacement would give error anyhow. And also, changes shall be done in a way that format is uniform over a single work. So there is no easy way to automate this. Just proceed in small chunks at a time.— Mpaa (talk) 21:37, 12 July 2019 (UTC)

Stripped tags...[edit]

I note a LOT of the entries for a 'stripped tag' are down to <references /></div> which should read simply <references />. Is this a task that a BOT could do, namely the removal of a spurious closing tag? Resolving this specfic use case would considerably reduce the backlog of LintErrror hunting? ShakespeareFan00 (talk) 01:31, 25 July 2019 (UTC)

Why are we even worrying about such tags? They have no visual presentational harm, so are just noise. It is a total waste of time to chase them and fix them. — billinghurst sDrewth 02:23, 25 July 2019 (UTC)
The concern you express is essentially the same as one of the strands of w:WP:COSMETIC. If you are saying that many of the detected concerns aren't worth the time to fix them, then perhaps we should have a local version of the equivalent policy? ShakespeareFan00 (talk) 06:57, 25 July 2019 (UTC)

A new madness[edit]

Not certain what is going on around. We used to keep things simple and let the words of the authors rule. Users maddeningly adding unnecessary complexity and hanging on every element of the printer and typographer. — billinghurst sDrewth 21:39, 7 August 2019 (UTC)

Me neither. Mpaa (talk) 12:53, 8 August 2019 (UTC)

Another madness[edit]

Hi, and greetings. I want to ask if these pages can be moved? If not, how do I go about getting the bot to do it. — Ineuw (talk) 01:11, 15 August 2019 (UTC)

Hi. I saw comments at Wikisource:Bot_requests#A_possible_bot_request, so it is unclear to me if this should be done or not. Waiting for clarifications.Mpaa (talk) 17:33, 15 August 2019 (UTC)
One more thing. New file at commons is 828 pages, your list is 830 pages. Is everything 1:1 except the last 2 pages or is there some shift to consider?Mpaa (talk) 17:39, 15 August 2019 (UTC)
I don't understand why user:billinghurst marked the job as manual, unless he read this post and removed it from the bot requests. Which means I made too many requests, here two, and one SQL request on the Quarry which is moot. So, let's wait and see.
The original file which should have been commons:File:The Conquest of Mexico (Bancroft)-Volume 1.djvu was 830 pages originally, but the page by page match was the same until page 827 so I removed two blank pages at the end to match the count. — Ineuw (talk) 21:31, 15 August 2019 (UTC)

Guide to the Bohemian exhibition again[edit]

Thanks again for fixing the problem with two pages at Index:Guide to the Bohemian section and to the Kingdom of Bohemia - 1906.djvu, which really helped me. Just now I have found out that there are two more pages missing completely (I did not notice it before, because there is an unnumbered page with a picture, due to which the fact that some pages were missing was not visible). The missing pages are no. 169 + the empty page before it. I have got only a pdf file of another copy of the book with the missing pages, which I have uploaded to https://drive.google.com/file/d/1icVy2acRT4V28LbZRije3qlO-Hm9o3yD/view?usp=sharing . Do you think you could extract the missing pages and add them to the djvu file?

Thank you very much. --Jan Kameníček (talk) 19:51, 16 August 2019 (UTC)

Hi. I have no tools to extract images from pdf. If you store the two image files, I can process them and add them to the djvu file.Mpaa (talk) 19:53, 16 August 2019 (UTC)
My software is pretty bad but finally I found out how to extract it. The empty page is at https://drive.google.com/file/d/1eElaYU7W_TbXdQN1NSKA_-b5KWqhEsbU/view?usp=sharing and the following page no. 169 is at https://drive.google.com/file/d/1LkKaETLQz2pNy5eIcWVArwFlPYKqqKck/view?usp=sharing . Thanks!!! --Jan Kameníček (talk) 20:08, 16 August 2019 (UTC)
Done.Mpaa (talk) 20:26, 16 August 2019 (UTC)
Oh, perfect, you are extremely quick! :-) Thanks very much!--Jan Kameníček (talk) 20:32, 16 August 2019 (UTC)

Automated repairs...[edit]

diff Around Line 24 - Automatable repair? If not I can do these by a manual check anyway. Doing this so I can greatly simplify the templates concerned.

The conversion at the start from {{nop}} to a and (LF/CR) at the start of the body is also a relatively simple fix to implement. As well as a related conversion in the footer. Again I can do these manually if needed (it will just take longer.) ShakespeareFan00 (talk) 14:04, 20 August 2019 (UTC)

No, line 24 it is not simple. I cannot reuse available tools.Mpaa (talk) 19:02, 20 August 2019 (UTC)
I figured that might be the case, Thanks for looking. I needed to manually check for typos anyway. ShakespeareFan00 (talk) 21:22, 20 August 2019 (UTC)