User talk:Mattwj2002

From Wikisource

Jump to: navigation, search

Collaboration of the Week

The current Collaboration of the Week is collecting the works of...
Niccolò Machiavelli.

Last week: Thomas Carlyle: see the improvements!
The next scheduled collaboration will begin January 8th.


George Smith by John Collier.jpg

Wikisource has a number of active Wikiprojects that could use
your help in tackling these large additions to our library.


Dictionary of National Biography Project
Work: Dictionary of National Biography


Featured article star - check.svg

The current Proofread of the Month is
The Passenger Pigeon  (1907)
by William Butts Mershon.

Last month: Handel
The next scheduled collaboration will begin in February.

Welcome to Wikisource!
Edmond Picard.jpg
Now that you're here, you're probably wondering...

Welcome! Thank you for joining Wikisource; we'd love for you to stick around and get more involved. We are a small community of approximately a hundred key people, with infinite help from random passers by. You might be wondering which of the two classes we consider you...well, I guess that's going to be up to you.

You'll find we are our own little corner of the Wikimedia Foundation, free from all the drama, arguments and policy violations you may be used to seeing elsewhere. In fact, since we largely just republish exactly what others before us have already written, there is very little concern about "neutrality" for example. After all, if the text of a notorious speech is inflammatory and biased...wasn't that its purpose?

If you're looking for a specific topic, you'll likely find it by navigating through Wikisource:Works, whether it's Wikisource:Islam or Wikisource:Mermaids. For overarching categories, you might be better looking at something like Category:Poems or Category:Novels. Of course, if you know the author's name, that's easiest of all, just plug in "Author:Rudyard Kipling" and you'll see everything he ever wrote (or was written about him!).

Chances are, you have a favourite subject we don't cover very well...here's how to change that!

So, your favourite author or subject isn't very well represented on the project? Well as long as you make sure the texts fit the standards of Public Domain, you can add them yourself! (Like all rules, those are basic guidelines, if you want to play with exceptions to the rule, just ask any of the administrators for help)

If the text doesn't already exist, just enter its name below and it will pre-load an editing page for you to set to work! Be sure to add {{no header}} to the top of the page, and then include categories so people can find it.


If you can't think of any particular corners to improve on Wikisource, how about taking a look at Wikisource:Religious texts, Wikisource:Wars or Wikisource:Texts by Country for some ideas? Don't forget to list your contributions on those pages as well so others will find and read them in the future!

Reading when you want, how you want
Francesco Hayez 027.jpg
Places to go, people to meet
Book of Hours detail.jpg

Well, if you've clicked all the way to this tab, you might as well plan on spending a few more hours acquainting yourself with our massive library. It's not perfect, sometimes there's an occasional misspelling or you'll see a text sorted incorrectly. So help us out, let us know, or fix it yourself!

If you're bored and just wanting to grab a mop and bucket, then there are plenty of corners that need tidying. Works that need to be split into chapters, Works that need their licensing clarified, Works that need machine-read words corrected, Works that need page-numbers removed and Authors whose full names we don't know would all be a great place to start!

Help us out

Contents

[edit] Dombey and Son

Hi, Mattwj2002,

I noticed that you are working on Dombey and Son. Since it seems like it's your first time here, I recommend that you read our general guidelines for adding texts. It will only take a few minutes to do. Specifically, I was wondering if you'd mind going back and adding {{header}} to all the pages of Dombey and Son. As we are currently undergoing a massive project to add this template to every page on WS, it would really help the few of us out who are undertaking this initiative. Thanks for your interest in and contributions to Wikisource!—Zhaladshar (Talk) 21:15, 10 July 2006 (UTC)

[edit] Littell's Living Age

I haven't actually worked on this particular project, but I heard that the plain text they provide is low quality OCR. If you are not finding errors however that is great. The Making of America project has a huge inventory so it makes sense if some of it was done using older OCR and some other things are better quality.--BirgitteSB 12:53, 6 September 2006 (UTC)

[edit] cookbook

Hi, this can be uploaded onto Commons, as it is commons:Template:PD-1923. Categorise English DJVU files into commons:Category:Scanned English texts. Cheers, John Vandenberg (chat) 01:09, 29 May 2008 (UTC)

ah, I see I am too late. :-) John Vandenberg (chat) 01:10, 29 May 2008 (UTC)
The next step is to create Index:The Pilgrim Cookbook.djvu, and then User:JVbot can upload the djvu text. John Vandenberg (chat) 01:20, 29 May 2008 (UTC)

[edit] Wind in the Willows

congrats on finishing it! John Vandenberg (chat) 00:01, 5 June 2008 (UTC)

[edit] adminship

Pls accept at Wikisource:Administrators#Mattwj2002. --John Vandenberg (chat) 01:28, 11 June 2008 (UTC)

[edit] Congradulations

You now have a sysop flag!--BirgitteSB 00:33, 19 June 2008 (UTC)

Congrats! giggy (:O) 08:06, 19 June 2008 (UTC)

[edit] Final push for the Proofread of the Month...

This month's Proofread of the Month, Index:The Pilgrim Cookbook.djvu, is still a ways away from being fully validated. However, we're within striking distance.

If all ten members proofread just two (but preferably three) pages a day, we'll be able to finish the book before the end of the month.

We can do it. :) EVula // talk // 01:20, 24 September 2008 (UTC)

[edit] Index:Montesquieu - The spirit of laws.djvu

Hi Mattwj2002,

Thanks, yes, if you can upload this OCR I will be very happy :) - --Zyephyrus (talk) 23:21, 16 November 2008 (UTC)

[edit] Task ordering for the 'bot?

Hi, Matt. Thanks for your 'bot. It appears to be extremely useful. I am one of the participants on the DNB project. Billinghurst, by dint of heroic effort, found DJVU files for all 63 original volumes and got them into Wikisource somehow, and then asked your 'bot to work its magic on them: that's more than 20,000 pages. I notice that the 'bot managed to get through vol 1 while also working on vols 6 and 37, but it seems to have stopped.

Questions:

  1. What is the order in which the 'bot attacks these pages?
  2. Roughly how long will it take to grind its way to the end?
  3. will I mess anything up if I create a text page "by hand" before the 'bot gets to it?

-Arch dude (talk) 02:52, 29 November 2008 (UTC)

Thanks for the reply. Fortunately, most of the volumes were not originally from Google Books and therefore do not have the problem you mentioned. I will avoid doing any pages "by hand" since these will cause you extra work. You mentioned "a few hours" per text, so at 63 texts, we are looking at at least a week: I can find useful stuff to do while I wait, no problem. -Arch dude (talk) 10:49, 29 November 2008 (UTC)
Thanks again. I'm not the uploader, and I have no clue as to how to upload the "right" files Do you have some guidance? -Arch dude (talk) 11:27, 29 November 2008 (UTC)

I have a minor conundrum. You and your friendly 'bot are used to helping editors who intend to transcribe entire single books from start to finish. The DNB is not small, and the tiny group that is working on it are not working front-to-back. Instead, we tend to work on specific articles of interest based on "what is needed most." In particular, I started the formal project as a place to hold the originals for articles that we need over at Wikipedia. This means that I jump all over the place within the 63 volumes. Based on the work you are doing, I was going to suspend my manual transcriptions until you finished the 63 volumes, and then shift to using the pagescan/proofread/transclude scheme. Apparently, you do not intend to blindly forge through all 63 volumes: I'm guessing that this is because you need to do a certain amount of babysitting of the 'bot to get this to work, and your work will be directed at texts that someone will actually look at in the next month or two. So, my problems is: wait for the 'bot (and quit making forward progress on the Wikipedia project) or continue manual transcriptions, knowing that I'm increasing the number of articles that I will eventually need to convert to the pagescan/proofread/transclude scheme. If you have a guesstimate on when you and the 'bot will finish the transmogrification, I can make a better assessment of which makes sense. It does not matter how long you intend to take: I appreciate what you are doing in any event, but it might make my planning a bit simpler if I know your intentions. -Arch dude (talk) 03:31, 2 December 2008 (UTC)

Hi AD and Matt. AD, Matt is doing the volumes, and is now up to vol.19. He has been waiting for me to reload the image files as they needed to be degooglefied[1], so hopefully we are talking days to a couple of weeks, not weeks or months. If you want to keep track of where they are up to then please follow progress at Wikisource:WikiProject DNB/Djvu files, which I am updating on a daily basis. -- billinghurst (talk) 22:55, 2 December 2008 (UTC)

^ Matt .. FINISHED just now! :-)

[edit] Djvutxt or sed, what parameters applied

Matt, when the bot rips the text, which of the applications are you running, and what parameters are you applying? Running it on the Windoze box is giving higher character weirdness compared to what I am seeing of your operations. Now I could fiddle or I can ask. :-) -- billinghurst (talk) 00:22, 3 December 2008 (UTC)

[edit] DJVU

Hey mate, I'm hopeless with DJVUs - but I found a nice 33-page book I'd love to see one day make featured text, so would proof it all myself, if you could just help me out by uploading the DJVU for The Fight at Dame Europa's School currently at http://www.archive.org/stream/fightatdameeurop00pulluoft/fightatdameeurop00pulluoft.djvu ? I'll muddle around then and learn my way around updating it :) Sherurcij Collaboration of the Week: Author:Nostradamus‎. 01:34, 5 December 2008 (UTC)

[edit] Wrong replacement for DNB Vol 2

For reasons that will forever remaina mystery, 2 of the U of Toronto DNB at archive.org has a different naming convention from the other volumes. The one you have listed:

http://www.archive.org/details/dictionaryofnati02leesuoft

is actually volume 2 of the suppliment.

The actual Volume 2 is at:

http://www.archive.org/details/dictionaryofnat02stepuoft

Note the missing "i" immediately before the "02".

There is also a set of volumes with names like:

http://www.archive.org/details/dictionaryofnati03stepuoft

The set is not complete. Note the "i" is there for all of these. -Arch dude (talk) 14:43, 7 December 2008 (UTC)


After investigation, I notice that all of the "lees" are suppliments, not original volumes. I am unclear on the copyrigth status of the suppliments after 1923. I made notes in your table. -Arch dude (talk) 14:53, 7 December 2008 (UTC)

Hi, Matt. I notice that you have updated the table. What is the current plan (if any) for replacing the page scans? I would like to re-start my transcription effort. If thre is not plan, I'll start using what we havd, but I'll also keep a "private" copy of any pages I transcribe. That way I will be able restore any page that you replace during your replacement effort: that should in turn allow you to eventually update to better OCRs without worrying about ovarlaying my work. Does this sound reasonable? -Arch dude (talk) 19:17, 14 December 2008 (UTC)

Arch dude, to answer your question I am trying to get a list of text to be uploaded here. These are going to be replacement text that are of the best quality without any missing pages, bad pages, or other problems. If you can help with this, I would really appreciate it. --Mattwj2002 (talk) 07:59, 15 December 2008 (UTC)
Sure! how can I help? -Arch dude (talk) 23:15, 15 December 2008 (UTC)

[edit] Hero!

Thanks for those and so promptly. Have a great New Year, and don't get caught being bad. :-) -- billinghurst (talk) 09:48, 31 December 2008 (UTC)

[edit] WikiMoney

Hey, following your (and everybody's) contributions to the discussion on the Scriptorium about buying books online specifically for Wikisource, I've created Wikisource:Purchases and request you all check it out; add books you see for sale anywhere online (not just eBay) that you'd like to see some collaborative interest on, and sign up to help on existing listings. Sherurcij Collaboration of the Week: Author:Bahá'u'lláh. 15:05, 29 January 2009 (UTC)

[edit] Moving pages

Hello,

Could you please move pages with your bot from Index:Principiaethican00mooruoft.djvu to Index:Principia Ethica 1922.djvu, see [1]. Thanks, Yann (talk) 12:04, 5 February 2009 (UTC)

[edit] DNB vol. 59 reload text

Matt. Referring to Wikisource talk:WikiProject DNB#Slippage, would you please get the bot to reload the text from the trimmed upload. Thx. -- billinghurst (talk) 01:41, 12 February 2009 (UTC)

[edit] Suck in text ...

As per your forthright instruction :-P Index:Highlights of Copyright Amendments Contained in the URAA Circular 38B Rev07-2006.djvu would appreciate it if its textual components could be united with its image. Thx. -- billinghurst (talk) 04:04, 15 February 2009 (UTC)

[edit] Tech head book <g>

Omnibuses and cabs : their origin and history

[edit] Re: The American Illustrated Medical Dictionary

So far as I can find, there are three editions on Google Books. However, none of them looks as though they will be particularly easy to pick up, as they all have the same types of scanning issues. BD2412 T 05:25, 1 April 2009 (UTC)

[edit] Index:A Brief History of Modern Philosophy.djvu

For Due. Watch him. -- billinghurst (talk) 04:31, 4 April 2009 (UTC)

[edit] I hate cabbage

Guinea pig food! -- billinghurst (talk) 03:23, 8 April 2009 (UTC)

[edit] Please bot

---^

Thanks. :) Jude (talk) 12:12, 10 April 2009 (UTC)

[edit] Index:What will he do with it.djvu and the question mark

Hi, after your name change I researched the question mark issue a little. In the book itself the question mark is missing on page 11, only to appear on page 13. The British Library uses a question mark most of the time, but not so when the title is cited in other works. amazon.co.uk is quite discordant on whether to use a question mark or not. Anyway, do what you please with this info.--GrafZahl (talk) 09:49, 13 April 2009 (UTC)

Copied the following reply from my talk page.--GrafZahl (talk) 17:01, 14 April 2009 (UTC)
Is it okay to leave it as it is now? I just saw page 11 and that is why I changed it. Please let me know what your input is on changing it. Thanks. --Mattwj2002 (talk) 01:41, 14 April 2009 (UTC)
I don't think it's that important whether we include the question mark or not (I did my little research because I thought you might be wrong, but the various sources are really undecided). MAybe it's best to leave it as it is now as I've started to proofread some pages.--GrafZahl (talk) 17:01, 14 April 2009 (UTC)

[edit] Re: alphabetisation

That's fine, unfortunately, it'll get nuked the next time I update it. :) I'll change the SQL to actually order the results instead of spitting them out randomly. Jude (talk) 06:55, 28 May 2009 (UTC)

[edit] Index:Vairagyasatakam.djvu

Hello Matt, Can you pls run the OCR bot on this one? Regards, Nvineeth (talk) 10:58, 3 June 2009 (UTC)

Thanks a lot. --Nvineeth (talk) 14:47, 4 June 2009 (UTC)

[edit] For bot boy

Hey mate, would you be so kind to pop over to User talk:Capitalismojo and bot the three works that are linked to from that page. Many thanks. Hope that you enjoyed your long drive.Emblem-BadTooth.svg -- billinghurst (talk) 14:24, 4 June 2009 (UTC)

And the thought about a Cheats and Walkthru for subpages and the like. Virtual beer in it for you. -- billinghurst (talk) 13:50, 5 June 2009 (UTC)

[edit] OCR button

Hello,

The ocr button is now enabled by default. you'll need to update your preferences as it's been disabled for you. sorry for the inconvenience. ThomasV (talk) 18:06, 8 July 2009 (UTC)

[edit] Page:Popular Science Monthly Volume 12.djvu/505

Hey Matt, when you have a moment, can you do some proofing on the Walter Bagehot sketch which is part of PSM Vol. 12. Thx. -- billinghurst (talk) 02:26, 3 August 2009 (UTC)

[edit] Popular Science

Well, I can't speak to others; but my main idea in joining was to familiarise myself with what articles are out there (even if not OCR/proofread yet) so that when a Collaboration of the Week deals with a science issue, I can run to Popular Science and proofread/OCR the three articles published in the magazine that deal with the subject. So on that vein, I would think having just a large "list of article titles" centralised somewhere would be very helpful - and I'd be willing to help. We could break it into three or four columns so it wasn't overly long. Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 14:00, 11 September 2009 (UTC)

I haven't quite got the hang of DJVUs yet, but I just created the copyright page for each of the first ten issues; would appreciate if you could transclude the Indexes for them. Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 09:59, 16 September 2009 (UTC)

[edit] doofus

This is the link that Mike gave you irc info

[edit] Cloak Request

I hereby officially request a cloak of Wikisource/Mattwj2002. --Mattwj2002 (talk) 15:33, 20 September 2009 (UTC)

[edit] DNB upload for vol. 9

If I'm understanding you correctly, this is just about the images (not the text, which remains the same). And the "old" djvus correspond to what is found at http://www.archive.org/details/dictionarynatio50stepgoog, for example in the "book reader" as well as the PDF download. NB that these Google-generated scans have numbers, here = 50, unrelated to the volume number, and the Commons page for some reason says "vol 48" on its link to archive.org; but the link of origin does go to the same place. With all that said, it does seem that the new scan is an improvement on some of the murkier pages, e.g. Page:Dictionary of National Biography volume 09.djvu/11 versus http://commons.wikimedia.org/w/index.php?title=File%3ADictionary_of_National_Biography_volume_09.djvu&page=11. And so if I have the right question in mind, the answer would be "yes, the older djvus can be deleted". Charles Matthews (talk) 12:57, 2 October 2009 (UTC)

[edit] Popular Science

As of today, I believe the first 30 volumes should each have their title page and copyright page proofread. I doubtless made some small errors with my wanton copy/paste/edit process, mixing up a date, or missing that both authors weren't credited in 1879, or that they changed address one issue earlier than I noticed...but just validate those when you get a chance if you can. Sherurcij Collaboration of the Week: Author:Carl Linnaeus. 18:35, 6 October 2009 (UTC)

No point sending the cash until we have somebody lined up in the United States to scan it within a few days of receiving it. Sherurcij Collaboration of the Week: Author:David Livingstone. 06:34, 15 October 2009 (UTC)
There is no webpage about it, you have to phone one of the scanning centres - of which you can find a list at the Texts division of archive.org Sherurcij Collaboration of the Week: Author:David Livingstone. 12:50, 15 October 2009 (UTC)
No response yet.
I only see "CHAPTER XLIII: 40 A.H.—Khariji conspiracy—'Ali assassinated—Mu'awiya escapes" on the Wiki copy, and on the AI copy - I don't see the issue? Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 20:13, 20 October 2009 (UTC)

[edit] Can you check if a text is in the public domain for me?

Unfortunately not; it's published in 1927, and was renewed in 1954 (R138631).--Prosfilaes (talk) 10:38, 9 October 2009 (UTC)

[edit] A Greek English Lexicon of the New Testament

Zyephyrus, I think your the only admin on the English Wikisource that knows Greek. I wanted to let you know about A Greek English Lexicon of the New Testament. Could you also please leave a message on the Greek Wikisource about this as well? I would let them know, but frankly it is Greek to me. Please leave a note on my talk page. Thanks. --Mattwj2002 (talk) 06:35, 10 October 2009 (UTC)

I can't write a text in Greek Mattwj2002, only copying one that already exists, I'm afraid that I can't be of any help. --Zyephyrus (talk) 10:13, 10 October 2009 (UTC)
  • I'm afraid that someone has been giving me more credit than I deserve! While I have had occasion to work with bits and pieces of Greek text, I am by no means capable of composing a sentence in Greek, and proofreading passages longer than that would soon overwhelm me. The work you cite is still mostly English, and to the best of my knowledge there has been no significant work done regarding how dictionaries are to be treated. That puts you on the ground floor for work about this important class of works. I apologize for being unable to be more helpful about Greek, but I do look forward to your input about dictionaries. Eclecticology - the offended (talk) 00:09, 12 October 2009 (UTC)

[edit] Glad to help

Hi, thanks for the welcome and glad to help when I can. Ineuw (talk) 02:07, 12 October 2009 (UTC)

[edit] More djvu-help?

Can You help me with another new old swedish bible I've found?

It's this pdf I need converted to djvu and the Google-pages in he beginning removed.

A proper name for the file could be: "Bibelen eller den Heliga Skrift (1828).djvu". -- Lavallen (talk) 12:04, 2 November 2009 (UTC)

A third bible (new testament) of interest would be this. (already a DjVu, but the Google pages has to be removed.) A proper name could be "Normalupplagan (1911).djvu". I will have busy days, filled with proofreading! :) -- Lavallen (talk) 16:20, 3 November 2009 (UTC)

[edit] Iran

Just one. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 03:12, 5 November 2009 (UTC)

[edit] The 5 volume purchase

Congrats for the 5 volume scan. I don't know if you prefer replies here, or on my talkpage. — Ineuw (talk) 02:13, 11 November 2009 (UTC)

[edit] Archaic spellings

Hi Matt; Just to keep you up to date, I posted an initial attempt to collect the archaic spellings extracted as of now, from PSM. Archaic spellings and names Have a nice day. Ineuw (talk) 16:02, 11 November 2009 (UTC)

[edit] 1st Image in Volume 2

Hi Matt. How are you? I uploaded to the commons, this image I found in Volume 2, Page 8. Unfortunately, I can't make out the name. Could you please advise? I must insert it, in the image info on the commons and assign a category. Many thanks. Ineuw (talk) 01:10, 13 November 2009 (UTC)

Thanks. I will track it down. :-) How are you? Ineuw (talk) 01:25, 13 November 2009 (UTC)
Your welcome Ineuw! I am doing well thank you! Just about ready to do some online Christmas shopping. I am on IRC too if you like to chat. :) --Mattwj2002 (talk) 01:28, 13 November 2009 (UTC)

[edit] An archive .org book request

Hi Matt,

If, and when it's possible, can we have this book scanned? into Wikisource to be proofread? http://ia331317.us.archive.org/3/items/stormylifeoflasi007823mbp/ Thanks. Ineuw (talk) 21:50, 22 November 2009 (UTC)

[edit] DJVUs

I've proofread all three of the DJVUs you did for me, much thanks, one is validated and the other two are in queue. I hunted around for more short ones and found the following if you can spare a few minutes of time.

Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 06:11, 28 November 2009 (UTC)

If that isn't short enough, how about Canadian Appeal for the Widows and Orphans of the South African War? Sherurcij Collaboration of the Week: Author:Thomas Carlyle. 05:41, 30 November 2009 (UTC)

[edit] Sleep

Hey, Matt. Sleep is the best. I hope you rested well. Within five minutes after my irc message, I was also soundly there. :-) — Ineuw (talk) 15:37, 2 December 2009 (UTC)

[edit] The elusive Volume 75

Hi Matt, it's been awhile since we connected. :-) Perhaps you can try the Robarts Library of the University of Toronto. They were/are one of the donors of archival to the IA. Another good library to approach may be the Toronto Public Library. — Ineuw (talk) 16:26, 7 December 2009 (UTC)

[edit] Thanks again

Thanks for the help last night. The index to Volume 1 of Science is up, and I've created the first 100 pages. Feel free to look it over for any obvious mistakes. I plan on proofing and developing the Table of Contents soon. Thanks again! --Clifflandis (talk) 20:05, 16 December 2009 (UTC)

[edit] What to do?

Hi Matt, when you get the chance, look at this page. Is there anything we can do about such scans? Page:Popular Science Monthly Volume 2.djvu/626 Sleep well. :-D — Ineuw (talk) 06:18, 18 December 2009 (UTC)

[edit] Proofreading documentation for PSM

Hi Matt, I've created this page about the proofreading process of PSM. Although it's based on The Popular Science Monthly, Volume 1, it's applicable to the style for decades to come. The idea is to introduce a standard for repeat titles, paragraphs, etc., and get the pages and page name structure finalized. I am writing a proposal for that separately.

Please look when you feel like it, and let me know what you think. Take care and sleep well. — Ineuw (talk) 03:33, 20 December 2009 (UTC)

[edit] PSM Volume 1 is proofread

Hi Matt, and first off, I missed you yesterday, and want to wish you a beautiful holiday season. Other than that the volume is proofread. I also completed some pages starting at the beginning and end:

Popular Science Monthly Volume 1/May 1872/The Study of Sociology I

Popular Science Monthly/Volume 1/Advertisements etc.

— Ineuw (talk) 00:00, 25 December 2009 (UTC)

[edit] Is it worth doing this?

Hi, Matt. Is it worth inserting images like this? If so, is it possible to eliminate the white frame? the table is already transparent. Here is the original page, Page:Popular Science Monthly Volume 1.djvu/802 resulting in Popular Science Monthly/Volume 1/Advertisements, bottom page.

Is there a possibility to fix the images so that they are not constrained by a visible frame and blend into the surroundings better?
This message was edited and updated from the original because the link to the page changed. — Ineuw (talk) 15:46, 2 January 2010 (UTC)