                               Document Storage

*****************************************************************************
 PLEASE REMEMBER THIS SOFTWARE COMES WITH NO WARRANTY AND YOU USE IT AT YOUR
 OWN RISK.  See the file LICENSE for more information.
*****************************************************************************

This is part of the Objectify project documentation.
Copyright (C) 2007   J. Scott Edwards
See the file README for copying conditions.


I am currently using this system to store all of my statements, receipts, etc.

While the system is not functional to the point where it can index all of the
files the way it should, it can still store them until all of the indexing is
working.

The procedure I use is as follows:

  1) I scan them in using a command like:

         cd /tmp
         scanimage --mode Color --resolution 300dpi | pnmtojpeg --quality=90 \
         > business_name-070825.jpg

     Where 070726 is YYMMDD, in this case August 25, 2007.

  2) I then use the Gimp to trim the borders if necessary.

  3) I store them into Objectify as follows:

         import_file business_name-070825.jpg xyzzy-070824.jpg

     If the documents contain really private information, such as bank
     account numbers I may change the security level a bit depending upon
     the nature of the documents:

         import_file --high-security ssn-070707.jpg \
         --very-low-security gasoline_receipt-070809.jpg

  4) After I have stored several documents, I make a compressed file of the
     entire system:

         mkdir /tmp/20070825-001
         cd /tmp/20070825-001
         compress_sparse compressed-070825 | tee compress_sparse-070825.txt

  5) I also make a compressed diff file to update the compressed file to
     update any compreseed file on remote systems (don't need to upload the
     whole compressed file that way):

         diff_compressed /media/cdrom0/compressed-070818 compressed-070825 \
           diff-070818-070825 | tee diff-070818-070825.txt
	 md5sum compressed-070825 > compressed-070825.md5
         scp -p diff-070818-070825 compressed-070825.md5 remote-machine:

  6) I then update and verify the compressed file on the remote machine:

         patch_compressed compressed-070818 diff-070818-070825 \
           compressed-070825 | tee patch-070818-070825.txt
	 md5sum -c compressed-070825.md5

  7) I then burn the compressed file to both a DVD+R and DVD-R (who knows
     which is better for long term storage?):

         growisofs -Z /dev/hdd -speed=4 -R -V 20070825-001 /tmp/20070825-001/

  8) I take the DVD that I am going to put in a secure off-site storage area
     and put it into a different computer that has Objectify stored on it and
     I copy the compressed file to the machine:

         cp -pi /media/cdrom0/compressed-070825 /tmp

  9) I then export the files that I want to verify from Objectify:

         export_file --compressed-file /tmp/compressed-070825 \
         business_name-070825.jpg xyzzy-070824.jpg ssn-070707.jpg \
         gasoline_receipt-070809.jpg

 10) Next I open each of the files with Konqueror or (whatever else works)
     and compare them to the original documents.  I make sure I can read the
     important information, such as the dollar amount, date, etc. from the
     copy exported from Objectify.

 11) After I have verified the stored copy is legible, if I don't need the
     paper copy anymore I shred it.

 12) I also use the compressed copy to verify and remove the copies from my
     hard drive:

         cd /tmp
         verify_file --compressed-file /tmp/compressed-070825 --remove *


