Wikisource:WikiProject OCR
From Wikisource
| WikiProject OCR |
| This project is for users to request for scans to be OCRed for various Wikisource-related projects. |
Contents |
[edit] Instruction
The participants listed below are users who have access to some kind of OCR software and are willing to extract text from scanned documents.
Users who desire for a text to be OCRed should place their request under the Requests section with the following format:
[[Title of the book]] (year published) - Author. # of pages. [source where pages can be found]
Note: "year published" should be when it was published in the U.S. as this will make determining the copyright status easier.
While these are the general instructions for requesting that a project be scanned, other users may have more specific instructions if they are to take on a project.
[edit] Participants
[edit] Zhaladshar
[edit] Instructions
Preference given to:
- Smaller requests
- Requests where obtaining the scans is easier (such as downloading a ZIP file instead of having to access each scan and download them all individually)
- Works that are hard to find in text form elsewhere on the Internet
- Works that I do not proofread
I will only work on two large projects at a time (they are first come, first serve) and will work smaller projects in the mix as I make time for them.
[edit] Current projects
| Title | Year published | Author | Pages | Source | Completion |
|---|---|---|---|---|---|
| Historical Library | 1814 | Diodorus Siculus (trans. G. Booth) | 677 | < 5% |
[edit] Benn Newman
[edit] Instructions
Preference given to:
- Smaller requests
- Requests where obtaining the scans is easier (such as downloading a ZIP file instead of having to access each scan and download them all individually)
- Works that are hard to find in text form elsewhere on the Internet
- Works that I have not proofread
[edit] Current projects
[edit] Requests
Cyclopaedia, or Universal Dictionary of Arts and Sciences (on Wikipedia) (1728) - Ephraim Chambers. Seems to be about 1430, according to the TOC. [1] --Rory096 02:59, 23 November 2006 (UTC)
Single European Act (on Wikipedia) a European Union treaty of 1986. It's quite short 29 pages a available in scanned PDF form. I've been looking for a text version for a while, but have never managed to find one. [2] Blue-Haired Lawyer (talk) 18:07, 21 December 2008 (UTC)
Vlas Mikhaĭlovich Doroshevich (w:ru:Дорошевич, Влас Михайлович) "The Way of the Cross" (translation by Stephen Graham, probably w:Stephen Graham (author)). Original Russian text in public domain (Doroshevich died in 1922). Book is public domain in USA (printed in 1916). --EugeneZelenko (talk) 03:41, 23 July 2009 (UTC)