Talk:A Treatise on Electricity and Magnetism

From Wikisource
Jump to navigation Jump to search

Various[edit]

This looks like a great project! I personally like to disregard the Volume notations when breaking things up by chapter as it hardly matters in digital texts. Also it would break the index up between volumes and force them all to be treated as entirely seprate works as far as the site syntax. Whereas if this page was moved to A Treatise on Electricity and Magnetism and all chapters where done as A Treatise on Electricity and Magnetism/Part I/Chapter I etc it is created as one work with one index. Right now with the current setup you are only allow the the relative linking features to be partially used. Does this make any sense?--BirgitteSB 19:51, 2 June 2006 (UTC)[reply]

It does make sense. However, I'm not the person putting up the texts, so I do not know if volume II starts with Part I or Part III, i.e. is each volume number serially or sequentally. I'll point Red Rooster to this so that he can check. I was debating whether to include the "Parts" (i.e. Part I and Part II) in the relative linking. It doesn't seem useful if there's no introductory text for the Parts, which there might be. Red Rooster, can you check this? Or I can check the pages Red Rooster's put up so far and check for Part I. - illy 20:19, 2 June 2006 (UTC)[reply]

If there is such a reason to include the Volumes make sure they are done with the relative linking and it will still work. One of my reasons for disliking them is how long it gets to do A Treatise on Electricity and Magnetism/Volume I/Part I/Chapter I. If there are only small prefaces for some of the divisions that are otherwise arbitary we can still leave them out of the relative scheme and just enter the text as A Treatise on Electricity and Magnetism/Preface to Foo in the proper order on the title page.--BirgitteSB 20:32, 2 June 2006 (UTC)[reply]
I checked "Part I" and there is no introductory text. Unfortunately, in each part the Chapters start numbering at 1, so you'd end up with multiple A Treatise on Electricity and Magnetism/Chapter 1. So the "Parts" will at least need to be referenced. Part of the structure will have to be determined after looking at the text. If the Volumes number the Parts like the Parts number the Chapters, we'll need to capture that somehow. - illy 20:46, 2 June 2006 (UTC)[reply]
And BTW, I do agree that it would be nice if we can get both volumes related. I do understand why that is desireable. - illy 20:51, 2 June 2006 (UTC)[reply]

So far, so good[edit]

I think at this stage the page titles are good like they are. We should read and correct the OCR-Text at first. If the texts are corrected, we should move it and glue them together in the chapters which are already created on the index page. Red Rooster 20:35, 2 June 2006 (UTC)[reply]

It is fine how you have the scanned pages set-up now. That is not how most people will be accesing this work in the end. My above comments are concened with the syntax used on this title page illy set-up. You see when pages use / in the title; they become related which is impotant for many reasons. But now the way things are set-up Volume I and Volume II will not be related when they are put together according to this title page. Since they are really just one work they really should be related. I am not good at explaining why, but it is important. I will try and get someone who understands it better to come comment tonight. Keep on as you are with the scanned pages, without worry. This just needed to be settled before we fill in the pages on this title page.--BirgitteSB 20:46, 2 June 2006 (UTC)[reply]
Yes, definiately continue going as you are. What Birgitte and I are discussing needs to hashed out before we start moving the proof-read text into the final pages, but the other work can continue. - illy 20:48, 2 June 2006 (UTC)[reply]

I understand why the plain text pages should be ordered by title, because the search-engine (and auto-navigation too) both need a good seperation of the different book parts. I could need some help to set up page 101 to the last page 489, because this is a real copy-job. You just have to insert the template in the pages and type in the numbers. I stopped at page 100 and will continue later. Another point on this manuscript are the formulas, indexes and pictures. My OCR-Tool (OmniPage SE) is not really good in scanning indexes and formulas. Is there anybody out who can type in the formulas and math expressions in the Wikipedia-LaTex-Language (I have not really an idea how this works!)? Red Rooster 23:19, 2 June 2006 (UTC)[reply]

Process?[edit]

Hi, I'm somewhat at loss concerning the process of digitising these tomes. I see a lot of scanned pages, some with OCR'd text, some without. Some text was already moved into the future article structure but still has OCR errors... huh? Basically I have two questions:

  • Is there any list/indicator as to which pages have not been OCR'd yet and which pages have been OCR'd but not yet been proofread by humans?
  • Is there any guideline what to do with a page once I OCR/proofread it?

Thanks!--GrafZahl 20:49, 16 June 2006 (UTC)[reply]


Until now there is nothing like an indicator if the Pages are proof-readed or not. If you can, please help me to indicate this sites. Red Rooster 10:30, 28 July 2006 (UTC)[reply]

I've created some category structure at Category:A Treatise on Electricity and Magnetism. Also, I've started to write a page for newcomers wanting to help.--GrafZahl 14:01, 28 July 2006 (UTC)[reply]

Page progress tracker[edit]

Information about this edition
Edition: Oxford Clarendon Press, 1873.
Source: scan, partially OCR
Contributor(s): Red Rooster (initial upload, some OCR dumps)

(note: this list is incomplete atm--GrafZahl (talk) 10:18, 12 August 2007 (UTC))[reply]

Level of progress: incomplete

(see below for more information)

Notes:
Proofreaders: For easier tracking, progress is recorded on a per page basis below. Once all pages have reached the 100% level, the individual proofreaders will be listed here.

(the following tables are incomplete atm--GrafZahl (talk) 15:35, 10 August 2007 (UTC))[reply]

Volume I[edit]

page icon progress proofreaders
001 complete, formatted, proofread once GrafZahl
002 complete, formatted, proofread once GrafZahl
003 complete, formatted, proofread once GrafZahl
004 blank page GrafZahl
005 complete, formatted, proofread once GrafZahl
006 complete, formatted, proofread once GrafZahl
007 complete, formatted, proofread once GrafZahl
008 complete, formatted, proofread once GrafZahl
009 complete, formatted, proofread once GrafZahl
010 complete, formatted, proofread once GrafZahl
011 complete, formatted, proofread once GrafZahl
012 complete, formatted, proofread once GrafZahl
013 complete, formatted, proofread once GrafZahl
014 complete, formatted, proofread once GrafZahl
015029 complete, partially proofread (once). These are the TOC pages. They need to be wikilinked to the actual contents. This should be done when everything else is at 75% or more. Some of these pages are erroneously classified as being at the 50% quality level. GrafZahl (only those pages which are in Category:A Treatise on Electricity and Magnetism: Proofread pages)
030 blank page GrafZahl
031 complete, formatted, proofread once GrafZahl
032 blank page GrafZahl
033 Errata; have been proofread but need to be formatted. Some thinking is required as to how these errata should be presented. For example, should we present each erratum on the page where the error occurred? Martin Gradwell [1]
034 more errata none
035 complete, formatted, proofread once GrafZahl
036 complete, formatted, proofread once GrafZahl
037 complete, formatted, proofread once GrafZahl
038 complete, formatted. Essentially proofread, but one letter in a formula is hardly readable on the scan. Someone will have to check the original. GrafZahl (except one unreadable letter)
039 complete, formatted, proofread once GrafZahl
040 complete, formatted, proofread once GrafZahl
041 complete, formatted, proofread once GrafZahl
042 complete, formatted, proofread once GrafZahl
043 complete, formatted, proofread once GrafZahl
044 complete, formatted, proofread once GrafZahl
045 complete, formatted, proofread once GrafZahl
046 complete, proofread once; formatting is complete up to a missing wikilink GrafZahl
047 complete, formatted, proofread once GrafZahl
048 complete, proofread once; uses obsolete template, needs to be formatted Acidbass [2]
049 complete, proofread once; uses obsolete template, needs to be formatted Acidbass [3]
050 complete, proofread once; uses obsolete template, needs to be formatted Acidbass [4]
051 complete, wrongly classified as 75%, initial formatting by ThomasV needs some final touches None
052 complete, initial formatting by ThomasV needs some final touches None
053 uncorrected OCR dump None
054 complete, initial formatting by ThomasV needs some final touches None

Volume II[edit]

page icon progress proofreaders
001 complete, formatted, proofread once GrafZahl
002 complete, formatted, proofread once GrafZahl
003 complete, formatted, proofread once GrafZahl
004 blank page GrafZahl

Google Books[edit]

The google books that I can see are "snippet only": [5] or "Preview only": vol. 1: [6] [7], vol. 2: [8] [9]. Maybe an US citizen will have better results, and be able to access the Google Books text layer. John Vandenberg (chat) 07:33, 7 June 2008 (UTC)[reply]

djvu files on archive.org[edit]

As we have only ~100 pages of text for each volume in the current pagescans, I think it is worth moving to these DJVU files, where User:JVbot can upload the djvu text layer. Alternatively I could try to upload the text from the djvu files to the individual images. John Vandenberg (chat) 09:32, 7 June 2008 (UTC)[reply]

The DJVU files have been set up: Index:A Treatise on Electricity and Magnetism - Volume 1.djvu and Index:A Treatise on Electricity and Magnetism - Volume 2.djvu. John Vandenberg (chat) 11:01, 3 July 2008 (UTC)[reply]

The Volume 1 djvu that I uploaded has two missing pages between xxiii and xxvi - these correspond to xxiv and xxv. John Vandenberg (chat) 07:03, 5 July 2008 (UTC)[reply]

this is annoying, because page numbering will be wrong. perhaps you can createe and upload a djvu that has the missing pages as blank pages, so that if we find them later and upload a new version of the file, we will not screw up all the numbering ? ThomasV 07:11, 5 July 2008 (UTC)[reply]
I've paused the process for the moment. John Vandenberg (chat) 07:31, 5 July 2008 (UTC)[reply]
I think I can insert the pages... John Vandenberg (chat) 07:43, 5 July 2008 (UTC)[reply]
I've inserted the old images into the djvu, and I've checked that there are no further surprises in volume I, so I am ready to upload a new version of the DJVU. I'll start inspecting Volume II to give you a little time to verify that uploading a new version of Volume I is the right solution.
There are extra pages in the djvu after Page:A Treatise on Electricity and Magnetism - Volume 1.djvu/483, which equates to Image:A Treatise on Electricity and Magnetism Volume 1 479.jpg (still an offset of 4 pages). John Vandenberg (chat) 08:11, 5 July 2008 (UTC)[reply]
I've uploaded a new copy of the djvu, so now p30 and p31 are the old images inserted in the correct spots. I'n now in the process of fixing up the 20 or so text-pages which are now on the wrong image-pages. John Vandenberg (chat) 10:19, 5 July 2008 (UTC)[reply]

Migration complete. There was very little in the way of transcription for Vol. II. The remaining pages will be automatically created when someone gets around to it: Wikisource:Transcription_Projects#DJVU_files_with_a_text_layer. John Vandenberg (chat) 11:44, 5 July 2008 (UTC)[reply]

order[edit]

I've found another difference between the two. The old set had a erratta after the title page, whereas the new djvu has the erratta before the title page:

29 Table of contents 35 Table of contents
30 Both then have a blank page. 36
31 title page 37 Errata
32 blank page 38 Errata
33 Errata 39 title page
34 Errata 40 blank page

Any idea which order is correct? John Vandenberg (chat) 10:19, 5 July 2008 (UTC)[reply]

Template:A Treatise style[edit]

I wrote a Template:A Treatise style to manage page style from outside the chapters code, since both prose class and leftalign class are too small for some long formulas.

It simply writes a div tag with left and right margin and indent. Default values are 10em, 10em, 1em, but they can be changed into the template itself. I added a font-size:0.9pc parameter and I'm happy from the result: the font size of TeX png is now very similar to that of the main text and much of graphic derangement has gone. Please tell your opinion.

Please stop me if any of you think I'm a little too bold. ;-) --Alex brollo 20:58, 6 July 2008 (UTC)[reply]

The template makes the book very narrow, both for display and for printing. It is unclear to me what exact problem the template is trying to solve. You mention some long formulas, but what about them? Can you link to the pages that show the problems with the long formulas? Can the problems with the long formulas be described in detail to be understandable and possibly solved in another way? --Dan Polansky (talk) 16:29, 22 April 2009 (UTC)[reply]