TheMorganReport:Community Portal
Equipment
- Windows XP box with 2GB RAM, AMD Athlon XP 2700+
- Plustek OpticBook 3600
Software
- Adobe Acrobat Professional 7.0
- Adobe Photoshop CS 8.0
- Abbyy FineReader 8.0 Professional
- OpticBook 3600 driver
Procedure
scan pages from opticbook 3600
300dpi, grayscale
batch rename pages
from "Image xxxx.jpg" to "xxxx-xxxx.jpg" to reflect page numbers
batch resize pages
using photoshop automation, save for web, jpeg low settings, and 38% scaled
wiki upload
batch upload jpg files to wiki
- Description: Reports of Committee on Foreign Relations 1789-1901 Volume 6 pp<xxx>-<xxx>
wiki stubs
- Navigate to page in wiki, and put in stub code
- for first page, make previous=Main Page, for last page, make next=Main Page
{{Double Page|previous=<xxx>-<xxx>|current=<xxx>-<xxx>|next=<xxx>-<xxx>}}
batch OCR full resolution pages
create PDF with FineReader
wiki upload text
- Click on the "Template:<xxx>-<xxx>" link and copy the text from the pdf
- Spell-check and copy edit the text
Notes
- Scanning in at 300ppi, 256 gray shades (8-bit grayscale)
- I'm not uploading the raw PDF files, since they're about 5 times as large as the jpgs
- Scanning pages 362-1169 (807 pages)
- 2 pages takes roughly 5 minutes to scan, convert, upload and add text
- 2017.5 minutes total required
- 33.625 hours required
- approximately 1 hour/day available
- about 40 days total
- 14 pages already scanned (but not proofed)