Wikisource:Scan Lab/Archives/2021-10

From Wikisource
Jump to navigation Jump to search
Warning Please do not post any new comments on this page.
This is a discussion archive first created in , although the comments contained were likely posted before and after this date.
See current discussion or the archives index.

Index:The Dial (Volume 75).pdf

Missing pages available at [1] Languageseeker (talk) 05:13, 8 October 2021 (UTC)

Done (rederived from a different scan, realised that was also missing pages and generally made a mess in the process). I think it's sorted now: Index:The_Dial_(Volume_75).djvu.
We might as well use the DJVU since if we rip the rest of the bunch, they'll come in as DJVUs too, most likely. Inductiveloadtalk/contribs 16:16, 8 October 2021 (UTC)
This section was archived on a request by: Inductiveloadtalk/contribs 16:16, 8 October 2021 (UTC)

The fables of Aesop, as first printed by William Caxton in 1484

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa) Greetings, everyone. This scan is missing two pages after page 26: the first one is the end of the Preface, the second one is a chart. They can be found in here and also here. Unfortunately, the file already has proofreading. Can something be done? thanks in advance. —Genesis Bustamante (talk) 17:23, 9 October 2021 (UTC)

@Genoskill Done. Existing pages shifted to match the new scan. ^_^ Inductiveloadtalk/contribs 19:56, 9 October 2021 (UTC)
@Inductiveload:: Thank you, very much! —Genesis Bustamante (talk) 20:31, 9 October 2021 (UTC)
:This section was archived on a request by: —Genesis Bustamante (talk) 20:31, 9 October 2021 (UTC)

Two requests from MarkLSteadman

This section was archived on a request by: --Xover (talk) 15:19, 17 October 2021 (UTC)

The Origin of Continents and Oceans (1924)

This section was archived on a request by: Inductiveloadtalk/contribs 08:55, 22 October 2021 (UTC)

creation of djvu from scans?

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa) There wasn't a heading for my question, so I decided "having page scans" is close.

If page scans of a book are supplied (perhaps in PNG format or TIF) can those be turned into DJVU here? And if so, what is the ideal format and if that is not available, what would be a second (and maybe third) choice (png, tiff, xcf?)

Also, I just read the instructions and am sorry that I have not closed any of my previous requests....--RaboKarbakian (talk) 19:02, 8 October 2021 (UTC)

@RaboKarbakian: Yes. We can build a DjVu file with an OCR text layer from scan images. Images can pretty much be any file format and we'll figure it out, but it's important that the images are as high-resolution as possible. It is far better to get whatever format your scanning process produced than to have them re-encoded to a different format after the fact (every re-encoding looses quality, and we'll have to re-encode at least once for the DjVu). If your scanning software offers you options, TIFF, JPEG, and PNG are all good options for format, just so long as the encoding / quality settings are reasonable. For example: JPEGs at "80%" quality are often plenty good enough, and higher values have rapidly diminishing returns in most cases. XCF is not a good format for this purpose.
If you're scanning yourself you should also either make sure the images are cropped just inside the page border (no black edge around the page, and the paper edge should just be cropped out, but the gutter retained), or you should make sure the page is in the same position within the image for every page. We can bulk crop a series of images, but in almost all cases that requires giving a set of fixed pixel coordinates to apply to each image in the series. If using a camera (vs. a flatbed) also try to make each page as flat as possible, since OCR has trouble with text that isn't on a straight line. Xover (talk) 06:25, 9 October 2021 (UTC)
@Xover: commons:Category:Goblin Market (Rackham) 45 files. TIFF, with lwz. 600ppi. And the text is straight even if the page edges are not. I did not want to hurt the book and I was confused about the size of the scanner glass and the scan area so, many of the edges are not really there and some were reconstructed. Various sizes but within a certain error which I have not calculated. The TIFF are as large or larger than the PNG but they were made in a fraction of the time it took to make PNG, so TIFF it is. The scans were to PNG though. I have been dealing with some artifacts on another project and did not want to have them in this one also. libtiff requires libjpeg to build....
At one time, I had a script that would download a whole category of images -- sure wish I had that now to give to you. Let me know if you would like me to rewrite that.... And be quick with the ping if you need anything else!! I surely thank you greatly for this--RaboKarbakian (talk) 04:33, 10 October 2021 (UTC)
@RaboKarbakian: According to enwp, Goblin Market illustrated by Arthur Rackham was first published in 1933 in London. Rackham died in 1939, so his pma. 70 UK term of copyright didn't expire until 2010. Being in copyright in its country of origin on the URAA date (January 1, 1996 for the UK) its US copyright would have been restored to a pub. + 95 year term, which will not expire until 2029. In other words, going by these the scan cannot be hosted either here or on Commons until 2029. Xover (talk) 14:00, 18 October 2021 (UTC)
Thanks for getting back with this. PseudoSkull mentioned something about "being printed the in the same two weeks" and that being difficult to prove. I had an idea (about efficiency) that this had been printed at the same time, in Edinburgh, just for efficiency and costliness of setting up the printer a second time. My foray in the "difficult to prove part"; it is also difficult to disprove....--RaboKarbakian (talk) 14:14, 18 October 2021 (UTC)
@RaboKarbakian: If a work was published in the US within 30 days of being published in another country, it is ineligible for restoration under the URAA. If that is the case its US copyright status will depend on whether it met all the formal requirements for copyright protection that were in effect at the time of publication. Simplifying down to rule-of-thumb level, that means it had to have a visible copyright notice and a renewal filed with the copyright office in the 28th year after first publication.
Proving publication within 30 days is hard, but you can often get a reasonable level of certainty by looking for advertisments, books received, or reviews that pinpoint the publication in the respective countries. Xover (talk) 14:52, 18 October 2021 (UTC)
@Xover: I'm going to close this and have those tif deleted. Just this one question tho': Is this where to put other similar (with the exception of being provably legal) requests?--RaboKarbakian (talk) 16:34, 20 October 2021 (UTC)
@RaboKarbakian: Yes, indeed. The Scan Lab was set up specifically to ask for various kinds of help with scans. Xover (talk) 05:44, 22 October 2021 (UTC)
This section was archived on a request by: RaboKarbakian (talk) 13:43, 21 October 2021 (UTC)

Index:Spencer - The Shepheardes Calender, conteining twelue æglogues proportionable to the twelue monethes, 1586.djvu

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa)This is missing two pages that I know of. Two pages that should go between page 27 and 28. While not the same book, the missing pages match perfectly from https://archive.org/details/shepheardscalend00spenc The missing pages are:

for page 28 and
for page 29.

I don't know about the rest of the scan; it is like another language and the missing pages had an image on them which was a big clue that there was a problem. If you would prefer to just upload the other text, I would be perfectly happy making whatever changes to make the already proofed text match. --RaboKarbakian (talk) 00:29, 13 October 2021 (UTC)

Done and pages shifted. No changes should be needed (other than proofreading the pages) unless there are transclusions I haven't seen. Inductiveloadtalk/contribs 20:55, 19 October 2021 (UTC)
Thanks!! --RaboKarbakian (talk) 14:00, 20 October 2021 (UTC)
This section was archived on a request by: --Xover (talk) 05:45, 22 October 2021 (UTC)

File:Draft Constitution of the Republic of the United States of Indonesia.pdf

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa) The pages in this scan are doubled up, could they be split, please? TE(æ)A,ea. (talk) 19:39, 19 October 2021 (UTC)

Doing… Inductiveloadtalk/contribs 19:43, 19 October 2021 (UTC)
Done: Split, dewarped and OCR'd: Index:Draft Constitution of the Republic of the United States of Indonesia.djvu. Inductiveloadtalk/contribs 20:41, 19 October 2021 (UTC)
This section was archived on a request by: --Xover (talk) 05:46, 22 October 2021 (UTC)

File:Code Revision Commission v. Public.Resource.Org, Inc. (F.Supp.3d).pdf and File:Code Revision Commission v. Public.Resource.Org, Inc. (F.3d).pdf

The following discussion is closed:

Resolved by User:Xover

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa) Please do the following for these scans:

For the first one, please remove the first page (which relates to the whole volume), and censor the following: the West Key logos (on pp. 1350 and 1361); the syllabus and headnotes to the case (from “background” on p. 1350 to just above the rule at the top of the second column on p. 1352); and the syllabus and headnote to the following case on p. 1361. For the second one, please remove the first two pages (once again, relating to the volume) and censor the following: the West Key logos (on pp. 1229 and 1255); the syllabus and headnotes to the case (from “background” on p. 1229 to just above the rule at the bottom of the first column on p. 1231); and the syllabus to the following case (on p. 1255).

Thank you. The old file/file version will need to be deleted, as the above-mentioned materials are copyrighted. TE(æ)A,ea. (talk) 15:43, 21 October 2021 (UTC)

@TE(æ)A,ea.: Done File:Code Revision Commission v. Public.Resource.Org, Inc. (F.Supp.3d).djvu and File:Code Revision Commission v. Public.Resource.Org, Inc. (F.3d).djvu. --Xover (talk) 08:51, 22 October 2021 (UTC)
This section was archived on a request by: Inductiveloadtalk/contribs 10:45, 12 November 2021 (UTC)

File:COMBAT1.tif, File:COMBAT2.tif, and File:COMBAT3.tif

Notifying all members of Scan Lab (more info · opt out): (User:Inductiveload, User:Xover, User:Mpaa) Please collate these scans into one file, preferable not a multi-page TIF. In addition, please remove the last page of the last scan, as that was made to show the text on that page. (Also, rotate the actual last page of the last scan; that was rotated incorrectly, for some reason.) TE(æ)A,ea. (talk) 20:15, 21 October 2021 (UTC)

Done see File:COMBAT.djvu. Mpaa (talk) 21:01, 21 October 2021 (UTC)
This section was archived on a request by: --Xover (talk) 05:47, 22 October 2021 (UTC)

File:The collected works of Henrik Ibsen (Volume 5).djvu

The following discussion is closed.

This section was archived on a request by: Inductiveloadtalk/contribs 09:18, 12 November 2021 (UTC)

Scan Repair for The Elizabethan stage (Volume 1).pdf

The following discussion is closed:

Resolved

The scan is missing pages xviii and xix after Page:The Elizabethan stage (Volume 1).pdf/25. Would it be possible to insert the missing pages from [3]? Languageseeker (talk) 19:35, 25 October 2021 (UTC)

@Languageseeker Is the scan otherwise intact? Inductiveloadtalk/contribs 20:13, 25 October 2021 (UTC)
@Inductiveload: I think so. Languageseeker (talk) 20:23, 25 October 2021 (UTC)
@Languageseeker then it is Done! Inductiveloadtalk/contribs 20:37, 25 October 2021 (UTC)
@Inductiveload: Thank you! Queued up the match-and-split. Languageseeker (talk) 20:39, 25 October 2021 (UTC)
This section was archived on a request by: Inductiveloadtalk/contribs 10:45, 12 November 2021 (UTC)

File:On the border with Crook (IA onborderwithcroo00bourrich).pdf

After Page:On the border with Crook (IA onborderwithcroo00bourrich).pdf/227, pages 196 and 197 need to be inserted from [4]]. Many Thanks! Languageseeker (talk) 22:11, 25 October 2021 (UTC)

Done
By the way, when requesting page insertions, please always include the position the missing pages should be inserted at, as otherwise the repairing user has to figure it out for themselves. If you have already confirmed it, it allows that user to double-check their maths. Durrr you did this. Sorry! Inductiveloadtalk/contribs 19:13, 11 November 2021 (UTC)
@Inductiveload: Could you also insert pg 490 and 491 after Page:On the border with Crook - Bourke - 1892.djvu/527 from [5]. Sorry, did not notice earlier.Languageseeker (talk) 00:27, 12 November 2021 (UTC)
Done Inductiveloadtalk/contribs 07:19, 16 November 2021 (UTC)
@Inductiveload: I think that you might have forgotten to upload the file to Commons. :) Languageseeker (talk) 14:55, 16 November 2021 (UTC)
Odd, I must have closed the tab or something. Anyhoo, it's there now. Inductiveloadtalk/contribs 16:41, 16 November 2021 (UTC)
Thank you! Languageseeker (talk) 11:26, 23 November 2021 (UTC)
This section was archived on a request by: Languageseeker (talk) 11:26, 23 November 2021 (UTC)

File:Travels through the states of North America, and the provinces of Upper and Lower Canada, during the years 1795, 1796, and 1797 (IA travelsthroughst01weld).pdf

After Page:Travels through the states of North America, and the provinces of Upper and Lower Canada, during the years 1795, 1796, and 1797 (IA travelsthroughst01weld).pdf/25, pages xviii and xix need to be inserted from [6]]. Many Thanks! Languageseeker (talk) 22:11, 25 October 2021 (UTC)

Done: Index:Travels through the states of North America - Weld - 1799 - Volume 1.djvu and Index:Travels through the states of North America - Weld - 1799 - Volume 2.djvu Inductiveloadtalk/contribs 16:59, 16 November 2021 (UTC)
This section was archived on a request by: Inductiveloadtalk/contribs 08:48, 3 December 2021 (UTC)