Wikisource:WikiProject OCR

From Wikisource

Jump to: navigation, search
WikiProject OCR
Shortcut:
WS:OCR
This project is for users to request for scans to be OCRed for various Wikisource-related projects.

Contents

[edit] Instruction

The participants listed below are users who have access to some kind of OCR software and are willing to extract text from scanned documents.

Users who desire for a text to be OCRed should place their request under the Requests section with the following format:

[[Title of the book]] (year published) - Author. # of pages. [source where pages can be found]

Note: "year published" should be when it was published in the U.S. as this will make determining the copyright status easier.

While these are the general instructions for requesting that a project be scanned, other users may have more specific instructions if they are to take on a project.

[edit] Participants

[edit] Zhaladshar

[edit] Instructions

Preference given to:

  1. Smaller requests
  2. Requests where obtaining the scans is easier (such as downloading a ZIP file instead of having to access each scan and download them all individually)
  3. Works that are hard to find in text form elsewhere on the Internet
  4. Works that I do not proofread

I will only work on two large projects at a time (they are first come, first serve) and will work smaller projects in the mix as I make time for them.

[edit] Current projects

Title Year published Author Pages Source Completion
Historical Library 1814 Diodorus Siculus (trans. G. Booth) 677 < 5%

[edit] Benn Newman

[edit] Instructions

Preference given to:

  1. Smaller requests
  2. Requests where obtaining the scans is easier (such as downloading a ZIP file instead of having to access each scan and download them all individually)
  3. Works that are hard to find in text form elsewhere on the Internet
  4. Works that I have not proofread

[edit] Current projects

World Revolution

[edit] Requests

Cyclopaedia, or Universal Dictionary of Arts and Sciences (on Wikipedia) (1728) - Ephraim Chambers. Seems to be about 1430, according to the TOC. [1] --Rory096 02:59, 23 November 2006 (UTC)

Single European Act (on Wikipedia) a European Union treaty of 1986. It's quite short 29 pages a available in scanned PDF form. I've been looking for a text version for a while, but have never managed to find one. [2] Blue-Haired Lawyer (talk) 18:07, 21 December 2008 (UTC)

Vlas Mikhaĭlovich Doroshevich (w:ru:Дорошевич, Влас Михайлович) "The Way of the Cross" (translation by Stephen Graham, probably w:Stephen Graham (author)). Original Russian text in public domain (Doroshevich died in 1922). Book is public domain in USA (printed in 1916). --EugeneZelenko (talk) 03:41, 23 July 2009 (UTC)

[edit] See also