TheMorganReport:Community Portal
Instructions for Editors
How We Did This
Equipment
- Windows XP box with 2GB RAM, AMD Athlon XP 2700+
- Plustek OpticBook 3600
Software
- Adobe Acrobat Professional 7.0
- Adobe Photoshop CS 8.0
- Abbyy FineReader 8.0 Professional
- OpticBook 3600 driver
Procedure
scan pages from opticbook 3600
300dpi, grayscale
batch rename pages
from "Image xxxx.jpg" to "xxxx-xxxx.jpg" to reflect page numbers
batch resize pages
using photoshop automation, save for web, jpeg low settings, and 38% scaled
wiki upload
batch upload jpg files to wiki
- created wiki.cfg file with two lines:
- user=
- password=
- Had Upload perl script in the same directory as wiki.cfg
- Used OpenOffice Calc to create a list of commands
perl wiki-upload.pl "502-503.jpg" "Reports of Committee on Foreign Relations 1789-1902 Volume 6 pp502-503"
- Copied commands into dos batch file, and executed batch
wiki stubs
- Navigate to page in wiki, and put in stub code
- for first page, make previous=Main Page, for last page, make next=Main Page
{{Double Page|previous=<xxx>-<xxx>|current=<xxx>-<xxx>|next=<xxx>-<xxx>}}
batch OCR full resolution pages
create PDF with FineReader
wiki upload text
- Click on the "Template:<xxx>-<xxx>" link and copy the text from the pdf
- Spell-check and copy edit the text