I'd like to build a simple application to:1. extract images from one or more pdf file/files converting them into tiff/jpg images;
2. autocrop them (to remove a white margin), then split them, since images are two-pages scans that need to be splitted into single-page images;
3a . load the resulting images into a zip file or3b. wrap them again into a pdf file with a lossless compression. Presently I'm doing the job using a local routine into my PC, and I use nconvert for step 1, and python (with PIL) for steps 2-3a.
I better would like to to this job into Labs; the best being, to addo routines to grab files from an URL and to load them into Internet Archive. Just to avoid "rediscovering the wheel", is someone doing something similar into Labs?
If the wheel has to be built ;-) , which tools have I to (painfully) study and test?Can I install nconvert for linux into my account for personal use?
Alex (from it.wikisource)